Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning.
Zhiyang XuChao FengRulin ShaoTrevor AshbyYing ShenDi JinYu ChengQifan WangLifu HuangPublished in: ACL (Findings) (2024)
Keyphrases
- visual processing
- visual tasks
- visual perception
- visual information
- visually guided
- training data
- human users
- human visual
- visual features
- supervised learning
- computer vision
- visual field
- visual attributes
- robotic systems
- human vision
- real time
- visuo motor
- active vision
- human observers
- human behavior
- low level
- cognitive model
- action recognition
- high level
- color perception
- multimedia
- eye movement patterns
- data sets