Login / Signup
Masked Vision and Language Modeling for Multi-modal Representation Learning.
Gukyeong Kwon
Zhaowei Cai
Avinash Ravichandran
Erhan Bas
Rahul Bhotika
Stefano Soatto
Published in:
ICLR (2023)
Keyphrases
</>
multi modal
language modeling
query expansion
information retrieval
active learning
language model
image annotation
multi modality
retrieval model
audio visual
feature extraction
high dimensional