Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation.

Published in: LREC/COLING (2024)

Keyphrases