Login / Signup
Masked Vision and Language Modeling for Multi-modal Representation Learning.
Gukyeong Kwon
Zhaowei Cai
Avinash Ravichandran
Erhan Bas
Rahul Bhotika
Stefano Soatto
Published in:
CoRR (2022)
Keyphrases
</>
multi modal
language modeling
learning process
language model
learning algorithm
query expansion
multi modality
high dimensional
low dimensional
image representation
bayesian networks
information retrieval
n gram
retrieval model
image processing
cross lingual
video search
cross modal