Login / Signup
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
Yuchen Hu
Ruizhe Li
Chen Chen
Heqing Zou
Qiushi Zhu
Eng Siong Chng
Published in:
IJCAI (2023)
Keyphrases
</>
cross modal
audio visual speech recognition
multi modal
multi stream
audio visual
multimedia retrieval
human computer interaction
multimedia databases
image retrieval
low level
probabilistic model
visual data
visual recognition