Alert button

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

Oct 19, 2021
Shuang Qiu, Jieping Ye, Zhaoran Wang, Zhuoran Yang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: