Login / Signup

Visual Cropping Improves Zero-Shot Question Answering of Multimodal Large Language Models.

Jiarui ZhangMahyar KhayatkhoeiPrateek ChhikaraFilip Ilievski
Published in: CoRR (2023)
Keyphrases