Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning.
Xu YangHanwang ZhangChongyang GaoJianfei CaiPublished in: Int. J. Comput. Vis. (2023)
Keyphrases
- image data
- single image
- input image
- learning algorithm
- multiscale
- supervised learning
- visual perception
- image classification
- learning process
- low level
- image analysis
- image features
- neural network
- image segmentation
- image content
- test images
- pixel values
- image retrieval
- edge detection
- high level
- visual cues
- visual processing
- computer vision
- spatial relations
- learning tasks
- visual information
- image regions