This&That: Language-Gesture Controlled Video Generation for Robot Planning.
Boyang WangNikhil SridharChao FengMark Van der MerweAdam FishmanNima FazeliJeong Joon ParkPublished in: CoRR (2024)
Keyphrases
- mobile robot
- motion planning
- robotic agents
- goal directed
- video content
- video streams
- natural language
- computer controlled
- real time
- video data
- multimedia
- programming language
- humanoid robot
- navigation tasks
- human robot interaction
- video frames
- space time
- gesture recognition
- video analysis
- pointing gestures
- world model
- language learning
- video clips
- multi robot
- video sequences
- multiple robots
- heuristic search
- position and orientation
- service robots
- vision system
- hidden markov models
- text generation
- programming environment
- story generation
- action selection mechanism
- english text
- robot manipulators
- visual servoing
- robot navigation
- autonomous robots
- path planning
- robotic systems
- planning problems