Alert button
Picture for Ian Osband

Ian Osband

Alert button

Hypermodels for Exploration

Add code
Bookmark button
Alert button
Jun 12, 2020
Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy

Figure 1 for Hypermodels for Exploration
Figure 2 for Hypermodels for Exploration
Figure 3 for Hypermodels for Exploration
Figure 4 for Hypermodels for Exploration
Viaarxiv icon

Stochastic matrix games with bandit feedback

Add code
Bookmark button
Alert button
Jun 09, 2020
Brendan O'Donoghue, Tor Lattimore, Ian Osband

Figure 1 for Stochastic matrix games with bandit feedback
Figure 2 for Stochastic matrix games with bandit feedback
Figure 3 for Stochastic matrix games with bandit feedback
Figure 4 for Stochastic matrix games with bandit feedback
Viaarxiv icon

Making Sense of Reinforcement Learning and Probabilistic Inference

Add code
Bookmark button
Alert button
Feb 14, 2020
Brendan O'Donoghue, Ian Osband, Catalin Ionescu

Figure 1 for Making Sense of Reinforcement Learning and Probabilistic Inference
Figure 2 for Making Sense of Reinforcement Learning and Probabilistic Inference
Figure 3 for Making Sense of Reinforcement Learning and Probabilistic Inference
Viaarxiv icon

Behaviour Suite for Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 13, 2019
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepezvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt

Figure 1 for Behaviour Suite for Reinforcement Learning
Figure 2 for Behaviour Suite for Reinforcement Learning
Figure 3 for Behaviour Suite for Reinforcement Learning
Figure 4 for Behaviour Suite for Reinforcement Learning
Viaarxiv icon

Meta-learning of Sequential Strategies

Add code
Bookmark button
Alert button
May 08, 2019
Pedro A. Ortega, Jane X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alex Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin Miller, Mohammad Azar, Ian Osband, Neil Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew Botvinick, Shane Legg

Figure 1 for Meta-learning of Sequential Strategies
Figure 2 for Meta-learning of Sequential Strategies
Figure 3 for Meta-learning of Sequential Strategies
Figure 4 for Meta-learning of Sequential Strategies
Viaarxiv icon

The Uncertainty Bellman Equation and Exploration

Add code
Bookmark button
Alert button
Oct 22, 2018
Brendan O'Donoghue, Ian Osband, Remi Munos, Volodymyr Mnih

Figure 1 for The Uncertainty Bellman Equation and Exploration
Figure 2 for The Uncertainty Bellman Equation and Exploration
Figure 3 for The Uncertainty Bellman Equation and Exploration
Figure 4 for The Uncertainty Bellman Equation and Exploration
Viaarxiv icon

Randomized Prior Functions for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 08, 2018
Ian Osband, John Aslanides, Albin Cassirer

Figure 1 for Randomized Prior Functions for Deep Reinforcement Learning
Figure 2 for Randomized Prior Functions for Deep Reinforcement Learning
Figure 3 for Randomized Prior Functions for Deep Reinforcement Learning
Figure 4 for Randomized Prior Functions for Deep Reinforcement Learning
Viaarxiv icon

Deep Exploration via Randomized Value Functions

Add code
Bookmark button
Alert button
Jun 06, 2018
Ian Osband, Benjamin Van Roy, Daniel Russo, Zheng Wen

Figure 1 for Deep Exploration via Randomized Value Functions
Figure 2 for Deep Exploration via Randomized Value Functions
Figure 3 for Deep Exploration via Randomized Value Functions
Figure 4 for Deep Exploration via Randomized Value Functions
Viaarxiv icon