Login / Signup

All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment.

Chunhui ZhangXin SunLi LiuYiqian YangQiong LiuXi ZhouYanfeng Wang
Published in: CoRR (2023)
Keyphrases
  • multi modal
  • real time
  • particle filter
  • computer vision
  • cross modal
  • multi modality
  • high dimensional
  • image annotation
  • image processing
  • audio visual
  • semantic concepts
  • video search
  • higher level