Sign in
Explore and Tell: Embodied Visual Captioning in 3D Environments.
Anwen Hu
Shizhe Chen
Liang Zhang
Qin Jin
Published in:
CoRR (2023)
Keyphrases
</>
visual information
visual features
visual cues
embodied cognition
data sets
multimedia
web services
image classification
dynamic environments
visual perception
visual representations
visual properties