Login / Signup
Unified Cross-Modal Attention: Robust Audio-Visual Speech Recognition and Beyond.
Jiahong Li
Chenda Li
Yifei Wu
Yanmin Qian
Published in:
IEEE ACM Trans. Audio Speech Lang. Process. (2024)
Keyphrases
</>
cross modal
audio visual speech recognition
multi modal
multimedia retrieval
visual data
visual recognition
multi stream
contextual information
multimedia databases
noisy environments