VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View.

Published in: AAAI (2024)

Keyphrases