Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation.
Kohei UeharaNabarun GoswamiHanqin WangToshiaki BabaKohtaro TanakaTomohiro HashimotoKai WangRei ItoTakagi NaoyaRyo UmagamiYingyi WenTanachai AnakewatTatsuya HaradaPublished in: CoRR (2024)