Login / Signup

APTQ: Attention-aware Post-Training Mixed-Precision Quantization for Large Language Models.

Ziyi GuanHantao HuangYupeng SuHong HuangNgai WongHao Yu
Published in: CoRR (2024)
Keyphrases