Login / Signup

VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models.

Chenyu ZhouMengdan ZhangPeixian ChenChaoyou FuYunhang ShenXiawu ZhengXing SunRongrong Ji
Published in: CoRR (2024)
Keyphrases