Login / Signup

Transformer-Exclusive Cross-Modal Representation for Vision and Language.

Andrew ShinTakuya Narihira
Published in: ACL/IJCNLP (Findings) (2021)
Keyphrases
  • natural language processing
  • cross modal
  • natural language
  • perceptual information
  • multi modal
  • computer vision
  • visual recognition
  • multimedia databases