Sign in

Explore and Tell: Embodied Visual Captioning in 3D Environments.

Anwen HuShizhe ChenLiang ZhangQin Jin
Published in: CoRR (2023)
Keyphrases
  • visual information
  • visual features
  • visual cues
  • embodied cognition
  • data sets
  • multimedia
  • web services
  • image classification
  • dynamic environments
  • visual perception
  • visual representations
  • visual properties