Sign in

MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering.

Aisha Urooj KhanAmir MazaheriNiels da Vitoria LoboMubarak Shah
Published in: EMNLP (Findings) (2020)
Keyphrases