Login / Signup
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers.
Jaemin Cho
Jiasen Lu
Dustin Schwenk
Hannaneh Hajishirzi
Aniruddha Kembhavi
Published in:
EMNLP (1) (2020)
Keyphrases
</>
multi modal
answer questions
visual features
multi modality
audio visual
cross modal
high dimensional
video retrieval
video search
computer vision
image processing
uni modal