Login / Signup

LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models.

Guangyan LiYongqiang TangWensheng Zhang
Published in: CoRR (2024)
Keyphrases