Login / Signup
Unsupervised Grounding of Textual Descriptions of Object Features and Actions in Video.
Muhannad Al-Omari
Eris Chinellato
Yiannis Gatsoulis
David C. Hogg
Anthony G. Cohn
Published in:
KR (2016)
Keyphrases
</>
textual descriptions
object features
metadata
semantic representation
semantic concepts
web images
d objects
semantic information
image features
multimedia
keywords
video data
multi modal
semi supervised
object recognition
video sequences
feature extraction
visual features
machine learning
natural language processing
video frames
multimedia content
object representation
visual scene
search engine