Login / Signup
Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM.
Luoming Zhang
Wen Fei
Weijia Wu
Yefei He
Zhenyu Lou
Hong Zhou
Published in:
CoRR (2023)
Keyphrases
</>
fine grained
coarse grained
massively parallel
access control
tightly coupled
quantization error
databases
search engine
data structure
user intent