Login / Signup

Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models.

Navid RajabiJana Kosecka
Published in: CoRR (2023)
Keyphrases