An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels.
Duy-Kien NguyenMahmoud AssranUnnat JainMartin R. OswaldCees G. M. SnoekXinlei ChenPublished in: CoRR (2024)
Keyphrases
- input image
- image pixels
- pixel values
- image data
- neighboring pixels
- image regions
- single image
- sample images
- grey level
- gray value
- pixel intensities
- grey levels
- homogeneous regions
- original images
- intensity values
- template matching
- multiscale
- image analysis
- image features
- adjacent regions
- high resolution
- edge detection
- image representation
- test images
- pixel wise
- adjacent pixels
- foreground and background
- image retrieval
- image content
- spatial information
- pixel classification
- hough transform
- gray level images
- feature points
- natural images
- gray level
- image classification
- edge pixels
- color components
- object recognition
- background image
- intensity difference
- color features
- image segmentation
- probe image