Login / Signup
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification.
Souhail Bakkali
Zuheng Ming
Mickaël Coustaty
Marçal Rusiñol
Oriol Ramos Terrades
Published in:
CoRR (2022)
Keyphrases
</>
document classification
cross modal
high level
probabilistic model
decision trees
prior knowledge
data sets
training set
pairwise
high dimensional
text mining
information retrieval systems
multi modal
text categorization
classification algorithm