Login / Signup

Unified Cross-Modal Attention: Robust Audio-Visual Speech Recognition and Beyond.

Jiahong LiChenda LiYifei WuYanmin Qian
Published in: IEEE ACM Trans. Audio Speech Lang. Process. (2024)
Keyphrases
  • cross modal
  • audio visual speech recognition
  • multi modal
  • multimedia retrieval
  • visual data
  • visual recognition
  • multi stream
  • contextual information
  • multimedia databases
  • noisy environments