Login / Signup

Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers.

Stella FrankEmanuele BugliarelloDesmond Elliott
Published in: EMNLP (1) (2021)
Keyphrases
  • cross modal
  • multi modal
  • computer vision
  • natural language
  • document retrieval
  • visual information
  • visual recognition