Deep Vision Transformer and T5-Based for Image Captioning.
Khang Nhut LamHuy Thanh NguyenVinh Phuoc MaiJugal KalitaPublished in: RIVF (2023)
Keyphrases
- image data
- input image
- image features
- image classification
- multiscale
- single image
- image analysis
- template matching
- image content
- high resolution
- image segmentation
- test images
- hough transform
- region of interest
- vision system
- image collections
- pixel values
- image synthesis
- grey level
- computer vision
- image representation
- feature points
- level set
- image processing
- image retrieval
- low level image processing
- low level vision
- real time
- visual perception
- color constancy
- image pixels
- video sequences
- fault diagnosis
- image set
- low level
- spatial information
- segmentation method