Sign in

Enhancing Human-like Multi-Modal Reasoning: A New Challenging Dataset and Comprehensive Framework.

Jingxuan WeiCheng TanZhangyang GaoLinzhuang SunSiyuan LiBihui YuRuifeng GuoStan Z. Li
Published in: CoRR (2023)
Keyphrases
  • multi modal
  • multi modality
  • cross modal
  • uni modal
  • high dimensional
  • low level
  • audio visual
  • computer vision
  • high level
  • object recognition
  • semantic concepts
  • multiple modalities