Login / Signup

Towards Lossless Head Pruning through Automatic Peer Distillation for Language Models.

Bingbing LiZigeng WangShaoyi HuangMikhail A. BraginJi LiCaiwen Ding
Published in: IJCAI (2023)
Keyphrases