Login / Signup
Enhancing Cross-Modal Understanding for Audio Visual Scene-Aware Dialog Through Contrastive Learning.
Feifei Xu
Wang Zhou
Guangzhen Li
Zheng Zhong
Yingchen Zhou
Published in:
ISCAS (2024)
Keyphrases
</>
cross modal
multi modal
real time
learning tasks
visual data
visual recognition
search engine
computer vision
image processing
multimedia
multimedia retrieval