Structured Scene Memory for Vision-Language Navigation.
Hanqing WangWenguan WangWei LiangCaiming XiongJianbing ShenPublished in: CVPR (2021)
Keyphrases
- d scene
- computer vision
- video sequences
- scene understanding
- vision system
- programming language
- multiple images
- scene analysis
- image sequences
- language learning
- dynamic scenes
- visual scene
- single image
- natural language
- scene classification
- three dimensional
- real time
- real world
- indoor and outdoor
- memory usage
- indoor environments
- memory requirements
- object oriented
- moving objects
- real scenes
- image regions
- information space
- structured data
- object detection