Multilevel Attention Networks and Policy Reinforcement Learning for Image Caption Generation.

Zhibo Zhou Xiaoming Zhang Zhoujun Li Feiran Huang Jie Xu

Published in: Big Data (2022)

Keyphrases

reinforcement learning
optimal policy
image classification
image data
template matching
image features
image analysis
low level
single image
segmentation method
input image
image retrieval
multiscale
image content
image segmentation
test images
image collections
visual features
similarity measure
state space
image regions
high resolution
video data
segmentation algorithm
region of interest
learning algorithm
image pixels
reward function
caption text