Sign in

Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining.

Ugur SahinHang LiQadeer KhanDaniel CremersVolker Tresp
Published in: CoRR (2023)
Keyphrases