ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models.

Published in: CoRR (2024)

Keyphrases