Login / Signup

ClawMachine: Fetching Visual Tokens as An Entity for Referring and Grounding.

Tianren MaLingxi XieYunjie TianBoyu YangYuan ZhangDavid S. DoermannQixiang Ye
Published in: CoRR (2024)
Keyphrases
  • visual information
  • low level
  • visual cues
  • neural network
  • website
  • visual features
  • visual perception
  • high level
  • mobile robot
  • social networks
  • line segments
  • spatial relations
  • visual analysis
  • visual tasks