Alert button

Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint

Add code
Bookmark button
Alert button
Jan 11, 2024
Zhipeng Chen, Kun Zhou, Wayne Xin Zhao, Junchen Wan, Fuzheng Zhang, Di Zhang, Ji-Rong Wen

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: