Login / Signup

SDQ: Sparse Decomposed Quantization for LLM Inference.

Geonhwa JeongPo-An TsaiStephen W. KecklerTushar Krishna
Published in: CoRR (2024)
Keyphrases