Sign in

Gradient-based Intra-attention Pruning on Pre-trained Language Models.

Ziqing YangYiming CuiXin YaoShijin Wang
Published in: CoRR (2022)
Keyphrases