Login / Signup

Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations.

Bowen ShenZheng LinDaren ZhaWei LiuJian LuanBin WangWeiping Wang
Published in: CoRR (2024)
Keyphrases