Login / Signup
VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models.
Chenyu Zhou
Mengdan Zhang
Peixian Chen
Chaoyou Fu
Yunhang Shen
Xiawu Zheng
Xing Sun
Rongrong Ji
Published in:
CoRR (2024)
Keyphrases
</>
prior knowledge
learning algorithm
learning models
image retrieval
image data
image analysis
image content
bayesian framework
input image
image features
high resolution
learning process
computer vision
image segmentation
image classification
single image
learned models
accurate models
visual perception
information retrieval
language learning
cognitive models
learning tasks
image representation
supervised learning
probabilistic model
image processing
document images
image regions
image collections
natural language