StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback.
Shihan DouYan LiuHaoxiang JiaEnyu ZhouLimao XiongJunjie ShanCaishuang HuangXiao WangXiaoran FanZhiheng XiYuhao ZhouTao JiRui ZhengQi ZhangTao GuiXuanjing HuangPublished in: ACL (1) (2024)
Keyphrases