A Comparative Study on Deep CNN Visual Encoders for Image Captioning.
M. ArunS. ArivazhaganR. HarinisriP. S. RaghaviPublished in: CVIP (3) (2023)
Keyphrases
- input image
- low level
- image data
- image classification
- human visual
- image features
- image collections
- visual cues
- image analysis
- high resolution
- visual perception
- single image
- human observers
- visual appearance
- multiscale
- image segmentation
- edge detection
- visually similar
- web images
- visual attributes
- template matching
- image pixels
- test images
- image structure
- visual information
- image content
- image representation
- visual features
- image retrieval
- visual attention
- visual data
- region of interest
- image regions
- segmentation method
- visual input
- visual properties