Login / Signup
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
Yuchen Hu
Ruizhe Li
Chen Chen
Heqing Zou
Qiushi Zhu
Eng Siong Chng
Published in:
CoRR (2023)
Keyphrases
</>
cross modal
audio visual speech recognition
multi modal
multimedia retrieval
multi stream
visual recognition
human computer interaction
audio visual
image retrieval
multimedia databases
visual data
multimedia
feature space
image data
image database
visual similarity