Login / Signup

Enhancing Cross-Modal Understanding for Audio Visual Scene-Aware Dialog Through Contrastive Learning.

Feifei XuWang ZhouGuangzhen LiZheng ZhongYingchen Zhou
Published in: ISCAS (2024)
Keyphrases
  • cross modal
  • multi modal
  • real time
  • learning tasks
  • visual data
  • visual recognition
  • search engine
  • computer vision
  • image processing
  • multimedia
  • multimedia retrieval