Login / Signup

Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models.

Yiwu ZhongZi-Yuan HuMichael R. LyuLiwei Wang
Published in: CoRR (2024)
Keyphrases
  • multi modal
  • cross modal
  • multi modality
  • video search
  • single modality
  • dimensionality reduction
  • visual information
  • image annotation
  • audio visual
  • humanoid robot