Login / Signup
Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs.
Mingyang Zhou
Yi R. Fung
Long Chen
Christopher Thomas
Heng Ji
Shih-Fu Chang
Published in:
CoRR (2023)
Keyphrases
</>
cross modal
multi modal
visual similarity
computer vision
visual recognition
image retrieval
multimedia retrieval
database
natural language
training set
supervised learning
multimedia databases
image representation
visual data
perceptual information