Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
Xiyao WangYuhang ZhouXiaoyu LiuHongjin LuYuancheng XuFeihong HeJaehong YoonTaixi LuFuxiao LiuGedas BertasiusMohit BansalHuaxiu YaoFurong HuangPublished in: ACL (1) (2024)
Keyphrases