Login / Signup
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View.
Raphael Schumann
Wanrong Zhu
Weixi Feng
Tsu-Jui Fu
Stefan Riezler
William Yang Wang
Published in:
AAAI (2024)
Keyphrases
</>
street view
multi agent
multi agent systems
computer vision
autonomous agents
vision system
text detection
object recognition