Login / Signup

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm.

Shaoyi HuangDongkuan XuIan En-Hsu YenYijue WangSung-En ChangBingbing LiShiyang ChenMimi XieSanguthevar RajasekaranHang LiuCaiwen Ding
Published in: ACL (1) (2022)
Keyphrases