Login / Signup

Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning.

Chenyu YangXizhou ZhuJinguo ZhuWeijie SuJunjie WangXuan DongWenhai WangLewei LuBin LiJie ZhouYu QiaoJifeng Dai
Published in: CoRR (2024)
Keyphrases