ROME: Evaluating Pre-trained Vision-Language Models on Reasoning beyond Visual Common Sense.

Published in: EMNLP (Findings) (2023)

Keyphrases