Login / Signup
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution.
Alex Gu
Baptiste Rozière
Hugh Leather
Armando Solar-Lezama
Gabriel Synnaeve
Sida I. Wang
Published in:
CoRR (2024)
Keyphrases
</>
plan execution
source code
data flow
knowledge base
qualitative reasoning
database
comparative analysis
reasoning process
reasoning systems
code generation
knowledge representation
fuzzy logic
knowledge structures
spatial reasoning
control flow
model based reasoning