Login / Signup
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge.
Xuan Shen
Peiyan Dong
Lei Lu
Zhenglun Kong
Zhengang Li
Ming Lin
Chao Wu
Yanzhi Wang
Published in:
AAAI (2024)
Keyphrases
</>
probabilistic inference
edge detection
edge information
bayesian networks
belief networks
edge detector
supply chain management
inference process
artificial intelligence
image processing
information processing
structured prediction
quantization error