Login / Signup
Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data.
Vladislav Lialin
Stephen Rawls
David Chan
Shalini Ghosh
Anna Rumshisky
Wael Hamza
Published in:
WACV (Workshops) (2023)
Keyphrases
</>
text data
learning process
prior knowledge
text mining
machine learning
multimedia
digital libraries
natural language processing
text classification
databases
learning algorithm
computer vision
image processing
database systems
high dimensional
image representation