Login / Signup
Attention as Grounding: Exploring Textual and Cross-Modal Attention on Entities and Relations in Language-and-Vision Transformer.
Nikolai Ilinykh
Simon Dobnik
Published in:
ACL (Findings) (2022)
Keyphrases
</>
cross modal
multi modal
co occurrence
image classification
spatial relations