Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos.
De-An HuangJoseph J. LimLi Fei-FeiJuan Carlos NieblesPublished in: CoRR (2017)
Keyphrases
- reference resolution
- referring expressions
- instructional videos
- natural language processing
- relation extraction
- natural language text
- content analysis
- machine learning
- visual information
- coreference resolution
- unsupervised learning
- visual features
- named entity recognition
- supervised learning
- domain specific
- information extraction
- natural language
- high level
- multimedia
- learning algorithm