Login / Signup
Stacked cross-modal feature consolidation attention networks for image captioning.
Mozhgan Pourkeshavarz
Shahabedin Nabavi
Mohsen Ebrahimi Moghaddam
Mehrnoush Shamsfard
Published in:
Multim. Tools Appl. (2024)
Keyphrases
</>
image features
cross modal
image data
image content
input image
image retrieval
image segmentation
image classification
multiscale
image representation
image regions
image collections
keypoints
test images
multi modal
spatial relationships