Login / Signup

Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge.

Xuan ShenPeiyan DongLei LuZhenglun KongZhengang LiMing LinChao WuYanzhi Wang
Published in: AAAI (2024)
Keyphrases