On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes.

Published in: ICLR (2024)

Keyphrases