Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding.
Minyoung HwangJaeyeon JeongMinsoo KimYoonseon OhSonghwai OhPublished in: CVPR (2023)
Keyphrases
- multiple objects
- complex scenes
- moving objects
- visual scene
- object models
- image regions
- real objects
- computer vision
- d scene
- visual input
- spatial relations
- d objects
- relative position
- programming language
- geometric information
- real world scenes
- ground plane
- real world objects
- object model
- uncalibrated images
- location and orientation
- camera images
- vision system
- reference object
- video sequences
- target object
- object features
- dynamic scenes
- object parts
- laser scanner
- object appearance
- three dimensional
- video scene
- intensity images
- multiple images
- object tracking
- natural language
- spatial location
- object motion
- meta level
- wire frame
- object segmentation
- appearance model
- image segments
- acquired images
- range images
- rigid body motion
- viewing position