Multilevel Attention Networks and Policy Reinforcement Learning for Image Caption Generation.
Zhibo ZhouXiaoming ZhangZhoujun LiFeiran HuangJie XuPublished in: Big Data (2022)
Keyphrases
- reinforcement learning
- optimal policy
- image classification
- image data
- template matching
- image features
- image analysis
- low level
- single image
- segmentation method
- input image
- image retrieval
- multiscale
- image content
- image segmentation
- test images
- image collections
- visual features
- similarity measure
- state space
- image regions
- high resolution
- video data
- segmentation algorithm
- region of interest
- learning algorithm
- image pixels
- reward function
- caption text