Login / Signup
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving.
Yilong Zhao
Chien-Yu Lin
Kan Zhu
Zihao Ye
Lequn Chen
Size Zheng
Luis Ceze
Arvind Krishnamurthy
Tianqi Chen
Baris Kasikci
Published in:
CoRR (2023)
Keyphrases
</>
computationally efficient
database
high quality
bit wise
real world
case study
image quality
visual quality