Login / Signup
MARIO Eval: Evaluate Your Math LLM with your Math LLM-A mathematical dataset evaluation toolkit.
Boning Zhang
Chengxi Li
Kai Fan
Published in:
CoRR (2024)
Keyphrases
</>
mathematical problem solving
tutoring system
evaluation model
evaluation method
intelligent tutors
data sets
real world
artificial intelligence
evaluation metrics
mathematical models
gold standard
synthetic datasets
mathematics learning