Login / Signup
A Speed Odyssey for Deployable Quantization of LLMs.
Qingyuan Li
Ran Meng
Yiduo Li
Bo Zhang
Liang Li
Yifan Lu
Xiangxiang Chu
Yerui Sun
Yuchen Xie
Published in:
CoRR (2023)
Keyphrases
</>
high speed
multi agent systems
processing speed
quantization error
data sets
learning algorithm
decision trees
data structure
bits per pixel