Login / Signup

GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models.

Rishabh AgarwalNino VieillardPiotr StanczykSabela RamosMatthieu GeistOlivier Bachem
Published in: CoRR (2023)
Keyphrases