Login / Signup
ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data.
Di Qi
Lin Su
Jia Song
Edward Cui
Taroon Bharti
Arun Sacheti
Published in:
CoRR (2020)
Keyphrases
</>
text data
cross modal
image retrieval
image data
input image
visual data
image features
image classification
multiscale
databases
supervised learning
visual similarity
multi modal
text classification
text mining
image collections
training set
data mining
face recognition