Login / Signup

PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning.

Xuekai ZhuBiqing QiKaiyan ZhangXinwei LongZhouhan LinBowen Zhou
Published in: NAACL-HLT (2024)
Keyphrases