Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes.
Gordon A. ChristieAnkit LaddhaAishwarya AgrawalStanislaw AntolYash GoyalKevin KochersbergerDhruv BatraPublished in: EMNLP (2016)