Login / Signup
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want.
Weifeng Lin
Xinyu Wei
Ruichuan An
Peng Gao
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Hongsheng Li
Published in:
CoRR (2024)
Keyphrases
</>
visual information
low level
fully understand
visual features
digital libraries
data structure
data sets
decision making
learning environment
computer vision
machine learning
databases
visual cues
human vision
deeper understanding
visual representation
visual properties
real time