Login / Signup

BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation.

Peng XuWenqi ShaoMengzhao ChenShitao TangKaipeng ZhangPeng GaoFengwei AnYu QiaoPing Luo
Published in: CoRR (2024)
Keyphrases