Multimodal One-Shot Learning of Speech and Images.
Ryan EloffHerman A. EngelbrechtHerman KamperPublished in: CoRR (2018)
Keyphrases
- image database
- image data
- input image
- three dimensional
- image retrieval
- audio signals
- image analysis
- rigid body
- image features
- image registration
- audio visual
- image classification
- test images
- multiple images
- multi modal
- object recognition
- ground truth
- edge detection
- visual data
- image collections
- multimodal image registration
- feature points
- automatic speech recognition
- small number
- pixel values
- image annotation
- image set
- lighting conditions
- image quality
- image regions
- computer graphics
- image processing