Login / Signup

Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!

Jack HesselLillian Lee
Published in: EMNLP (1) (2020)
Keyphrases
  • multi modal
  • cross modal
  • information retrieval
  • multimedia
  • data structure
  • video sequences
  • data points
  • image database
  • wordnet