Aligning Images and Text with Semantic Role Labels for Fine-Grained Cross-Modal Understanding.

Published in: LREC (2022)

Keyphrases