Sign in

Improving task-agnostic BERT distillation with layer mapping search.

Xiaoqi JiaoHuating ChangYichun YinLifeng ShangXin JiangXiao ChenLinlin LiFang WangQun Liu
Published in: Neurocomputing (2021)
Keyphrases