Image Captioners Are Scalable Vision Learners Too.
Michael TschannenManoj KumarAndreas SteinerXiaohua ZhaiNeil HoulsbyLucas BeyerPublished in: CoRR (2023)
Keyphrases
- single image
- input image
- image data
- image content
- image analysis
- image segmentation
- multiscale
- image features
- image classification
- low level vision
- segmentation method
- image collections
- real time
- image representation
- template matching
- vision system
- edge detection
- image retrieval
- pixel values
- image synthesis
- image set
- image pixels
- collaborative learning
- image regions
- keypoints
- low level
- grey level
- learning styles
- image structure
- low level image processing
- lighting conditions
- region of interest
- vector field
- learning activities
- digital images
- high resolution
- image processing