Efficient Machine Translation with Model Pruning and Quantization.
Maximiliana BehnkeNikolay BogoychevAlham Fikri AjiKenneth HeafieldGraeme NailQianqian ZhuSvetlana TchistiakovaJelmer van der LindePinzhen ChenSidharth KashyapRoman GrundkiewiczPublished in: WMT@EMNLP (2021)