Login / Signup

Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge.

Xuan ShenPeiyan DongLei LuZhenglun KongZhengang LiMing LinChao WuYanzhi Wang
Published in: CoRR (2023)
Keyphrases