Resolving vision and language ambiguities together: Joint segmentation & prepositional attachment resolution in captioned scenes.
Gordon A. ChristieAnkit LaddhaAishwarya AgrawalStanislaw AntolYash GoyalKevin KochersbergerDhruv BatraPublished in: Comput. Vis. Image Underst. (2017)