Alert button

Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration

Dec 01, 2023
Viraj Mehta, Vikramjeet Das, Ojash Neopane, Yijia Dai, Ilija Bogunovic, Jeff Schneider, Willie Neiswanger

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: