Alert button

ReFT: Reasoning with Reinforced Fine-Tuning

Jan 17, 2024
Trung Quoc Luong, Xinbo Zhang, Zhanming Jie, Peng Sun, Xiaoran Jin, Hang Li

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: