Alert button

RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs

Add code
Bookmark button
Alert button
May 15, 2023
Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon

Figure 1 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Figure 2 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Figure 3 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Figure 4 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: