Login / Signup
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval.
Yiwei Ma
Guohai Xu
Xiaoshuai Sun
Ming Yan
Ji Zhang
Rongrong Ji
Published in:
CoRR (2022)
Keyphrases
</>
end to end
text retrieval
learning process
scalable video
image retrieval
video clips
document retrieval
retrieval model
real time
video streams
key frames
congestion control
application layer
wireless networks
document collections
video sequences
multimedia