Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox


Deep Exploration via Randomized Value Functions

Jun 06, 2018
Ian Osband, Benjamin Van Roy, Daniel Russo, Zheng Wen



We study the use of randomized value functions to guide deep exploration in reinforcement learning. This offers an elegant means for synthesizing statistically and computationally efficient exploration with common practical approaches to value function learning. We present several reinforcement learning algorithms that leverage randomized value functions and demonstrate their efficacy through computational studies. We also prove a regret bound that establishes statistical efficiency with a tabular representation.



Share this with someone who'll enjoy it:

   Access Paper Source



Share this with someone who'll enjoy it: