Login / Signup

Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition.

Sungnyun KimKangwook JangSangmin BaeHoirin KimSe-Young Yun
Published in: CoRR (2024)
Keyphrases
  • cross modal
  • visual recognition
  • video sequences
  • multi modal
  • multimedia
  • video data
  • multimedia databases