Login / Signup
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction.
Size Wu
Wenwei Zhang
Lumin Xu
Sheng Jin
Xiangtai Li
Wentao Liu
Chen Change Loy
Published in:
ICLR (2024)
Keyphrases
</>
prediction accuracy
computer vision
prediction model
real time
fuzzy logic
prediction algorithm
vision system
prediction error
image processing
evolutionary algorithm
scene flow
stereo correspondence
predictive model
three dimensional
metadata
artificial intelligence
data sets