Login / Signup

WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More.

Yuxuan YueZhihang YuanHaojie DuanmuSifan ZhouJianlong WuLiqiang Nie
Published in: CoRR (2024)
Keyphrases