Unsupervised Learning of Spoken Language with Visual Context.
David F. HarwathAntonio TorralbaJames R. GlassPublished in: NIPS (2016)
Keyphrases
- spoken language
- visual context
- unsupervised learning
- temporal context
- scene interpretation
- object detection
- dialogue system
- supervised learning
- language processing
- semantic context
- semantic analysis
- object recognition
- semi supervised
- machine learning
- visual scene
- visual words
- expectation maximization
- dimensionality reduction
- text classification
- temporal information
- video annotation
- domain ontology
- wordnet
- search engine