Login / Signup

Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models.

Jesse AtuhurraIqra AliTatsuya HiraokaHidetaka KamigaitoTomoya IwakuraTaro Watanabe
Published in: CoRR (2024)
Keyphrases