VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification.

Published in: Pattern Recognit. (2023)

Keyphrases