Login / Signup
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers.
Qintong Li
Leyang Cui
Xueliang Zhao
Lingpeng Kong
Wei Bi
Published in:
CoRR (2024)
Keyphrases
</>
computational efficiency
end to end
sat solving
mathematical expressions
database
search engine
evolutionary algorithm
sat solvers
distributed computing
quantified boolean formulas
computationally hard problems