Exploring the Grounding Issues in Image Caption.
Pin-Er ChenHsin-Yu ChouPo-Ya Angela WangYu-Hsiang TsengShu-Kai HsiehPublished in: CoRR (2023)
Keyphrases
- input image
- image analysis
- high resolution
- image data
- image features
- template matching
- multiscale
- single image
- image content
- image classification
- image retrieval
- image collections
- test images
- image representation
- image segmentation
- image structure
- vector field
- energy function
- spatial information
- edge detection
- low level
- keypoints
- image matching
- segmentation algorithm
- feature points
- image pixels
- grey level
- caption text
- image regions
- segmentation method
- region of interest
- relevance feedback
- video retrieval
- optical flow
- bounding box