Login / Signup

Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition.

Xichen PanPeiyu ChenYichen GongHelong ZhouXinbing WangZhouhan Lin
Published in: CoRR (2022)
Keyphrases
  • audio visual speech recognition
  • multi stream
  • dimensionality reduction
  • multi modal
  • information retrieval
  • feature selection
  • em algorithm
  • mixture model
  • audio visual