BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions.
Terry Yue ZhuoMinh Chien VuJenny ChimHan HuWenhao YuRatnadira WidyasariImam Nur Bani YusufHaolan ZhanJunda HeIndraneil PaulSimon BrunnerChen GongThong HoangArmel Randy ZebazeXiaoheng HongWen-Ding LiJean KaddourMing XuZhihan ZhangPrateek YadavNaman JainAlex GuZhoujun ChengJiawei LiuQian LiuZijian WangDavid LoBinyuan HuiNiklas MuennighoffDaniel FriedXiaoning DuHarm de VriesLeandro von WerraPublished in: CoRR (2024)