Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!

Published in: EMNLP (1) (2020)

Keyphrases