Alert button

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

Feb 05, 2024
Shihan Dou, Yan Liu, Haoxiang Jia, Limao Xiong, Enyu Zhou, Wei Shen, Junjie Shan, Caishuang Huang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Xuanjing Huang, Tao Gui

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: