Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

Add code
May 24, 2022

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: