Login / Signup
Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!
Jack Hessel
Lillian Lee
Published in:
EMNLP (1) (2020)
Keyphrases
</>
multi modal
cross modal
information retrieval
multimedia
data structure
video sequences
data points
image database
wordnet