Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition.

Published in: CoRR (2022)

Keyphrases