Alert button

$\sqrt{n}$-Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank

Sep 08, 2019
Kefan Dong, Jian Peng, Yining Wang, Yuan Zhou

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: