OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models.

Published in: AAAI (2024)

Keyphrases