Login / Signup
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval.
Yiwei Ma
Guohai Xu
Xiaoshuai Sun
Ming Yan
Ji Zhang
Rongrong Ji
Published in:
ACM Multimedia (2022)
Keyphrases
</>
end to end
text retrieval
learning process
video clips
real time
information retrieval
reinforcement learning
active learning
learning algorithm
computational complexity
image retrieval
document collections
inverted file
application layer
scalable video
text localization and recognition