AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability.
Siwei YangBingchen ZhaoCihang XiePublished in: CoRR (2024)
Keyphrases
- cognitive abilities
- reasoning systems
- knowledge representation
- reasoning process
- probabilistic reasoning
- computer graphics
- database
- real world
- knowledge base
- user friendly
- automated reasoning
- computer algebra systems
- interactive systems
- graphical interface
- data visualization
- data analysis
- e learning
- artificial intelligence
- neural network
- real time