CropCap: Embedding Visual Cross-Partition Dependency for Image Captioning.
Bo WangZhao ZhangSuiyi ZhaoHaijun ZhangRichang HongMeng WangPublished in: ACM Multimedia (2023)
Keyphrases
- image data
- visual appearance
- input image
- multiscale
- visual perception
- image segmentation
- image features
- image content
- low level
- single image
- image classification
- image representation
- visually similar
- image retrieval
- edge detection
- visual attributes
- image regions
- image analysis
- human visual
- data embedding
- region of interest
- image collections
- watershed transformation
- image registration
- object recognition
- spatial relations
- test images
- visual information
- spatial information
- high resolution
- segmentation algorithm
- visual features