VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification.

Published in: CoRR (2022)

Keyphrases