Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition.

Published in: LREC/COLING (2024)

Keyphrases