Alert button

Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes

Dec 11, 2018
Jian Qian, Ronan Fruit, Matteo Pirotta, Alessandro Lazaric

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: