Login / Signup

Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding.

Connor HolmesMinjia ZhangYuxiong HeBo Wu
Published in: CoRR (2022)
Keyphrases