Alert button

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

May 24, 2022
Xiaoyu Chen, Han Zhong, Zhuoran Yang, Zhaoran Wang, Liwei Wang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: