Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer.

Published in: ICASSP (2024)

Keyphrases