Login / Signup

Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization.

Andrea FasoliChia-Yu ChenMauricio J. SerranoSwagath VenkataramaniGeorge SaonXiaodong CuiBrian KingsburyKailash Gopalakrishnan
Published in: INTERSPEECH (2022)
Keyphrases