Synthesizing Spoken Descriptions of Images.
Xinsheng WangJustin van der HoutJihua ZhuMark Hasegawa-JohnsonOdette ScharenborgPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2021)
Keyphrases
- image data
- image database
- three dimensional
- ground truth
- image classification
- input image
- multiple images
- rigid body
- image registration
- image analysis
- original images
- image features
- test images
- image set
- similarity measure
- image pixels
- image retrieval
- object recognition
- natural images
- computer graphics
- image description
- image collections
- image processing algorithms
- spoken language
- pixel values
- image structure
- image annotation
- image matching
- image regions
- edge detection