• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition.

Xichen PanPeiyu ChenYichen GongHelong ZhouXinbing WangZhouhan Lin
Published in: CoRR (2022)
Keyphrases
  • audio visual speech recognition
  • multi stream
  • dimensionality reduction
  • multi modal
  • information retrieval
  • feature selection
  • em algorithm
  • mixture model
  • audio visual