Entity-Aware Multimodal Alignment Framework for News Image Captioning.
Junzhe ZhangHuixuan ZhangXiaojun WanPublished in: CoRR (2024)
Keyphrases
- image features
- image alignment
- single image
- image retrieval
- image data
- input image
- image analysis
- similarity measure
- template matching
- image representation
- feature points
- bayesian framework
- region of interest
- image classification
- image content
- image segmentation
- segmentation algorithm
- spatial information
- image collections
- computer vision
- probabilistic model
- segmentation method
- image matching
- low level
- high resolution
- image pixels
- knowledge base