Channel Vision Transformers: An Image Is Worth 1 x 16 x 16 Words.

Yujia Bao Srinivasan Sivanandan Theofanis Karaletsos

Published in: ICLR (2024)

Keyphrases

image data
input image
template matching
image content
high resolution
image representation
image features
image analysis
image segmentation
image classification
test images
real time
image retrieval
hough transform
image regions
single image
visual perception
image pixels
multiscale
image synthesis
low level image processing
pixel values
segmentation algorithm
auto annotation
region of interest
keypoints
image registration
low level
keywords
computer vision