Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning.
Zhiyang XuChao FengRulin ShaoTrevor AshbyYing ShenDi JinYu ChengQifan WangLifu HuangPublished in: CoRR (2024)
Keyphrases
- visual processing
- visual perception
- human vision
- visual tasks
- visual attributes
- human visual
- computer vision
- training data
- high level
- vision system
- visual field
- human subjects
- active vision
- visually guided
- human observers
- color perception
- visuo motor
- real time
- visual information
- human users
- human experts
- text classification
- supervised learning
- perceptual information
- low level
- multimedia
- image processing