Login / Signup
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment.
Chunhui Zhang
Xin Sun
Li Liu
Yiqian Yang
Qiong Liu
Xi Zhou
Yanfeng Wang
Published in:
CoRR (2023)
Keyphrases
</>
multi modal
real time
particle filter
computer vision
cross modal
multi modality
high dimensional
image annotation
image processing
audio visual
semantic concepts
video search
higher level