Sign in

ROME: Evaluating Pre-trained Vision-Language Models on Reasoning beyond Visual Common Sense.

Kankan ZhouEason LaiWei Bin Au YeongKyriakos MouratidisJing Jiang
Published in: CoRR (2023)
Keyphrases