Unsupervised Grounding of Textual Descriptions of Object Features and Actions in Video.

Published in: KR (2016)

Keyphrases