MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering.

Published in: EMNLP (Findings) (2020)

Keyphrases