PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning.

Published in: NAACL-HLT (2024)

Keyphrases