Login / Signup

Self-Distilled Quantization: Achieving High Compression Rates in Transformer-Based Language Models.

James O'NeillSourav Dutta
Published in: CoRR (2023)
Keyphrases