Login / Signup

QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models.

Tommaso PegolottiElias FrantarDan AlistarhMarkus Püschel
Published in: CoRR (2023)
Keyphrases