Login / Signup
Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition.
Sungnyun Kim
Kangwook Jang
Sangmin Bae
Hoirin Kim
Se-Young Yun
Published in:
CoRR (2024)
Keyphrases
</>
cross modal
visual recognition
video sequences
multi modal
multimedia
video data
multimedia databases