Login / Signup
CIBench: Evaluating Your LLMs with a Code Interpreter Plugin.
Songyang Zhang
Chuyu Zhang
Yingfan Hu
Haowen Shen
Kuikun Liu
Zerun Ma
Fengzhe Zhou
Wenwei Zhang
Xuming He
Dahua Lin
Kai Chen
Published in:
CoRR (2024)
Keyphrases
</>
artificial intelligence
source code
programming language
database
error handling
data mining
java virtual machine
databases
neural network
computer vision
knowledge base
case study
multi agent
production system
error correction
error detection