Login / Signup

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers.

Cong WeiYang ChenHaonan ChenHexiang HuGe ZhangJie FuAlan RitterWenhu Chen
Published in: CoRR (2023)
Keyphrases
  • multimodal information
  • visual data
  • training set
  • video data
  • feature selection
  • high level
  • video sequences
  • nearest neighbor
  • natural language processing