Speaker-Targeted Audio-Visual Speech Recognition Using a Hybrid CTC/Attention Model with Interference Loss.

Published in: ICASSP (2022)

Keyphrases