Login / Signup
Integer Scale: A Free Lunch for Faster Fine-grained Quantization of LLMs.
Qingyuan Li
Ran Meng
Yiduo Li
Bo Zhang
Yifan Lu
Yerui Sun
Lin Ma
Yuchen Xie
Published in:
CoRR (2024)
Keyphrases
</>
fine grained
coarse grained
access control
tightly coupled
quantization error
scale space
databases
multiscale
massively parallel