Sign in

A Unified Multi-modal Structure for Retrieving Tracked Vehicles through Natural Language Descriptions.

Dong XieLinhu LiuShengjun ZhangJiang Tian
Published in: CVPR Workshops (2023)
Keyphrases
  • multi modal
  • natural language descriptions
  • multi modality
  • real time
  • high dimensional
  • cross modal
  • audio visual
  • machine learning
  • similarity measure
  • graph cuts
  • image annotation
  • video search
  • uni modal