Login / Signup

Self-Distilled Quantization: Achieving High Compression Rates in Transformer-Based Language Models.

James O'NeillSourav Dutta
Published in: ACL (2) (2023)
Keyphrases