CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation.
Weixiang YanHaitian LiuYunkun WangYunzhe LiQian ChenWen WangTingyu LinWeishan ZhaoLi ZhuShuiguang DengHari SundaramPublished in: CoRR (2023)
Keyphrases