Alert button

Dense Reward for Free in Reinforcement Learning from Human Feedback

Feb 01, 2024
Alex J. Chan, Hao Sun, Samuel Holt, Mihaela van der Schaar

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: