Sign in

Blockwise Compression of Transformer-based Models without Retraining.

Gaochen DongWei Chen
Published in: CoRR (2023)
Keyphrases