Login / Signup
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment.
Chunhui Zhang
Xin Sun
Yiqian Yang
Li Liu
Qiong Liu
Xi Zhou
Yanfeng Wang
Published in:
ACM Multimedia (2023)
Keyphrases
</>
multi modal
real time
computer vision
audio visual
multi modality
image processing
semantic concepts
particle filter
cross modal
high dimensional
appearance model
video search
object recognition
image annotation
humanoid robot