Login / Signup

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks.

Jiannan WuMuyan ZhongSen XingZeqiang LaiZhaoyang LiuWenhai WangZhe ChenXizhou ZhuLewei LuTong LuPing LuoYu QiaoJifeng Dai
Published in: CoRR (2024)
Keyphrases