Login / Signup
mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning.
Jingxuan Wei
Nan Xu
Guiyong Chang
Yin Luo
Bihui Yu
Ruifeng Guo
Published in:
CoRR (2024)
Keyphrases
</>
question answer
multi modal
computer vision
meta level
natural language
reasoning systems
real time
information retrieval
language learning
programming language
vision system
real world
knowledge representation
image processing
uml class diagrams
asked questions
knowledge base
machine learning