Login / Signup
Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers.
Stella Frank
Emanuele Bugliarello
Desmond Elliott
Published in:
EMNLP (1) (2021)
Keyphrases
</>
cross modal
multi modal
computer vision
natural language
document retrieval
visual information
visual recognition