Picture for Ian Osband

Ian Osband

Tony

On Lower Bounds for Regret in Reinforcement Learning

Add code
Aug 09, 2016
Figure 1 for On Lower Bounds for Regret in Reinforcement Learning
Viaarxiv icon

Posterior Sampling for Reinforcement Learning Without Episodes

Add code
Aug 09, 2016
Viaarxiv icon

Deep Exploration via Bootstrapped DQN

Add code
Jul 04, 2016
Figure 1 for Deep Exploration via Bootstrapped DQN
Figure 2 for Deep Exploration via Bootstrapped DQN
Figure 3 for Deep Exploration via Bootstrapped DQN
Figure 4 for Deep Exploration via Bootstrapped DQN
Viaarxiv icon

Generalization and Exploration via Randomized Value Functions

Add code
Feb 15, 2016
Figure 1 for Generalization and Exploration via Randomized Value Functions
Figure 2 for Generalization and Exploration via Randomized Value Functions
Figure 3 for Generalization and Exploration via Randomized Value Functions
Figure 4 for Generalization and Exploration via Randomized Value Functions
Viaarxiv icon

Bootstrapped Thompson Sampling and Deep Exploration

Add code
Jul 01, 2015
Figure 1 for Bootstrapped Thompson Sampling and Deep Exploration
Viaarxiv icon

Model-based Reinforcement Learning and the Eluder Dimension

Add code
Oct 31, 2014
Viaarxiv icon

Near-optimal Reinforcement Learning in Factored MDPs

Add code
Oct 31, 2014
Viaarxiv icon

(More) Efficient Reinforcement Learning via Posterior Sampling

Add code
Dec 26, 2013
Figure 1 for (More) Efficient Reinforcement Learning via Posterior Sampling
Figure 2 for (More) Efficient Reinforcement Learning via Posterior Sampling
Figure 3 for (More) Efficient Reinforcement Learning via Posterior Sampling
Viaarxiv icon