Login / Signup
ClawMachine: Fetching Visual Tokens as An Entity for Referring and Grounding.
Tianren Ma
Lingxi Xie
Yunjie Tian
Boyu Yang
Yuan Zhang
David S. Doermann
Qixiang Ye
Published in:
CoRR (2024)
Keyphrases
</>
visual information
low level
visual cues
neural network
website
visual features
visual perception
high level
mobile robot
social networks
line segments
spatial relations
visual analysis
visual tasks