Login / Signup

Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective.

Jongwoo KoSeungjoon ParkMinchan JeongSukjin HongEuijai AhnDu-Seong ChangSe-Young Yun
Published in: CoRR (2023)
Keyphrases