Channel Vision Transformers: An Image Is Worth 1 x 16 x 16 Words.
Yujia BaoSrinivasan SivanandanTheofanis KaraletsosPublished in: ICLR (2024)
Keyphrases
- image data
- input image
- template matching
- image content
- high resolution
- image representation
- image features
- image analysis
- image segmentation
- image classification
- test images
- real time
- image retrieval
- hough transform
- image regions
- single image
- visual perception
- image pixels
- multiscale
- image synthesis
- low level image processing
- pixel values
- segmentation algorithm
- auto annotation
- region of interest
- keypoints
- image registration
- low level
- keywords
- computer vision