Login / Signup

Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?

Letitia ParcalabescuAnette Frank
Published in: CoRR (2024)
Keyphrases