Login / Signup

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities.

Boyuan ChenZhuo XuSean KirmaniBrian IchterDanny DriessPete FlorenceDorsa SadighLeonidas J. GuibasFei Xia
Published in: CoRR (2024)
Keyphrases