Login / Signup

OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization.

Cong GuoJiaming TangWeiming HuJingwen LengChen ZhangFan YangYunxin LiuMinyi GuoYuhao Zhu
Published in: ISCA (2023)
Keyphrases