Login / Signup
CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment.
Hongwei Xue
Yuchong Sun
Bei Liu
Jianlong Fu
Ruihua Song
Houqiang Li
Jiebo Luo
Published in:
ICLR (2023)
Keyphrases
</>
pre trained
statistical model
image classification
input image
image segmentation
probabilistic model
video clips
data sets
feature selection
image retrieval
training examples
key frames