Sign in

SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger.

Yuting GaoJinfeng LiuZihan XuTong WuWei LiuJie YangKe LiXing Sun
Published in: CoRR (2023)
Keyphrases
  • cross modal
  • multi modal
  • multimedia retrieval
  • visual recognition
  • perceptual information
  • visual similarity
  • visual data
  • high level
  • image retrieval
  • image understanding
  • video clips
  • multimedia databases