Sequential image encoding for vision-to-language problems.
Jicheng WangYuanen ZhouZhenzhen HuXu ZhangMeng WangPublished in: Multim. Tools Appl. (2021)
Keyphrases
- input image
- image features
- image classification
- multiscale
- image data
- single image
- segmentation method
- image content
- image analysis
- template matching
- image collections
- image segmentation
- image synthesis
- image retrieval
- vision system
- low level image processing
- real time
- computer vision
- pixel values
- language learning
- test images
- hough transform
- image regions
- programming language
- high resolution
- energy function
- segmentation algorithm
- spatial information
- feature points
- edge detection
- image database
- image set
- image registration
- low level
- block coding
- natural language