MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for speech recognition.

Published in: INTERSPEECH (2023)

Keyphrases