Login / Signup

Extreme Compression of Large Language Models via Additive Quantization.

Vage EgiazarianAndrei PanferovDenis KuznedelevElias FrantarArtem BabenkoDan Alistarh
Published in: CoRR (2024)
Keyphrases