High-Quality Image Captioning With Fine-Grained and Semantic-Guided Visual Attention.
Zongjian ZhangQiang WuYang WangFang ChenPublished in: IEEE Trans. Multim. (2019)
Keyphrases
- fine grained
- visual attention
- high quality
- coarse grained
- visual perception
- salient regions
- image data
- input image
- visual attention model
- saliency map
- attention mechanism
- eye tracking
- high resolution
- access control
- image content
- visual saliency detection
- image features
- image classification
- multiscale
- biological vision systems
- image retrieval
- image representation
- visual search
- image segmentation
- eye movements
- image collections
- higher level
- image quality
- vision system
- region of interest
- natural scenes
- visual motion
- stereoscopic images
- eye fixations
- biologically inspired
- salient features
- image regions
- video data
- human computer interaction
- co occurrence
- high level