Login / Signup
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Mohit Bansal
Anna Rohrbach
Kai-Wei Chang
Zhewei Yao
Kurt Keutzer
Published in:
ICLR (2022)
Keyphrases
</>
computer vision
visually guided
natural language
programming language
multiple tasks
image processing
real time
vision system
transfer learning
general purpose
website
language learning
video content
context dependent
language processing
english language
data mining
real world