Sign in

OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization.

Cong GuoJiaming TangWeiming HuJingwen LengChen ZhangFan YangYunxin LiuMinyi GuoYuhao Zhu
Published in: CoRR (2023)
Keyphrases