Login / Signup

I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models.

Xing HuYuan ChengDawei YangZhihang YuanJiangyong YuChen XuSifan Zhou
Published in: CoRR (2024)
Keyphrases