Login / Signup
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning.
Chenyu Yang
Xizhou Zhu
Jinguo Zhu
Weijie Su
Junjie Wang
Xuan Dong
Wenhai Wang
Lewei Lu
Bin Li
Jie Zhou
Yu Qiao
Jifeng Dai
Published in:
CoRR (2024)
Keyphrases
</>
text data
latent variable models
image representation
image segmentation
image data
probabilistic model
learning algorithm
computer vision
supervised learning
input image
training set
email
active learning
data analysis
pattern recognition
multiscale
metadata
search engine