Grounding Conversational Robots on Vision Through Dense Captioning and Large Language Models.

Published in: ICRA (2024)

Keyphrases