Login / Signup

GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers.

Qintong LiLeyang CuiXueliang ZhaoLingpeng KongWei Bi
Published in: CoRR (2024)
Keyphrases
  • computational efficiency
  • end to end
  • sat solving
  • mathematical expressions
  • database
  • search engine
  • evolutionary algorithm
  • sat solvers
  • distributed computing
  • quantified boolean formulas
  • computationally hard problems