C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View.
Raphael Schumann
Wanrong Zhu
Weixi Feng
Tsu-Jui Fu
Stefan Riezler
William Yang Wang
Published in:
CoRR (2023)
Keyphrases
</>
street view
multi agent systems
multi agent
computer vision
vision system
autonomous agents
text detection