Integrating Language and Vision to Generate Natural Language Descriptions of Videos in the Wild.
Jesse ThomasonSubhashini VenugopalanSergio GuadarramaKate SaenkoRaymond J. MooneyPublished in: COLING (2014)
Keyphrases
- natural language descriptions
- natural language
- vision system
- programming language
- computer vision
- real time
- conceptual graphs
- natural language processing
- language processing
- machine learning
- image processing
- video sequences
- spatio temporal
- space time
- machine translation
- video frames
- language learning
- visual perception
- human vision