CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models.
Fuwen LuoChi ChenZihao WanZhaolu KangQidong YanYingjie LiXiaolong WangSiyu WangZiyue WangXiaoyue MiPeng LiNing MaMaosong SunYang LiuPublished in: CoRR (2024)
Keyphrases