Vision Transformer Interpretability via Prediction of Image Reflected Relevance Among Tokens.
Kento SagoKazuhiro HottaPublished in: ICPRAM (2024)
Keyphrases
- input image
- image data
- prediction accuracy
- image retrieval
- image content
- single image
- image features
- multiscale
- image analysis
- image segmentation
- image classification
- image collections
- visual perception
- image regions
- image synthesis
- region of interest
- edge detection
- image representation
- low level image processing
- low level
- similarity measure
- real time
- segmentation method
- pixel values
- high resolution
- computer vision
- test images
- information retrieval
- color vision
- vector field
- image matching
- hough transform
- keypoints
- fault diagnosis
- segmentation algorithm
- feature points
- image quality
- fuzzy logic