Exploiting Pseudo Image Captions for Multimodal Summarization.

Chaoya Jiang Rui Xie Wei Ye Jinan Sun Shikun Zhang

Published in: ACL (Findings) (2023)

Keyphrases

image classification
input image
image analysis
image data
single image
image content
image features
image retrieval
multi modal
template matching
image segmentation
feature points
image representation
low level
high resolution
edge detection
image collections
region of interest
visual features
spatial information
hough transform
segmentation method
multiscale
computer vision
lighting conditions
image set
visual content
image pixels
multimodal image registration