Sign in

COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval.

Haoyu LuNanyi FeiYuqi HuoYizhao GaoZhiwu LuJi-Rong Wen
Published in: CVPR (2022)
Keyphrases
  • cross modal
  • retrieval systems
  • training set
  • computer vision
  • object recognition
  • image retrieval
  • text retrieval
  • multimedia retrieval