Sign in

Robust Self-Supervised Audio-Visual Speech Recognition.

Bowen ShiWei-Ning HsuAbdelrahman Mohamed
Published in: INTERSPEECH (2022)
Keyphrases
  • audio visual speech recognition
  • multi stream
  • audio visual
  • noisy environments
  • multimedia
  • high dimensional