Login / Signup

Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy.

Simon GingMaría Alejandra BravoThomas Brox
Published in: CoRR (2024)
Keyphrases