Alert button
Picture for David Silver

David Silver

Alert button

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

Nov 19, 2019
Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, Demis Hassabis, Thore Graepel, Timothy Lillicrap, David Silver

Figure 1 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 2 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 3 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 4 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Viaarxiv icon

Discovery of Useful Questions as Auxiliary Tasks

Sep 10, 2019
Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Richard Lewis, Janarthanan Rajendran, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Discovery of Useful Questions as Auxiliary Tasks
Figure 2 for Discovery of Useful Questions as Auxiliary Tasks
Figure 3 for Discovery of Useful Questions as Auxiliary Tasks
Figure 4 for Discovery of Useful Questions as Auxiliary Tasks
Viaarxiv icon

Behaviour Suite for Reinforcement Learning

Aug 13, 2019
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepezvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt

Figure 1 for Behaviour Suite for Reinforcement Learning
Figure 2 for Behaviour Suite for Reinforcement Learning
Figure 3 for Behaviour Suite for Reinforcement Learning
Figure 4 for Behaviour Suite for Reinforcement Learning
Viaarxiv icon

On Inductive Biases in Deep Reinforcement Learning

Jul 05, 2019
Matteo Hessel, Hado van Hasselt, Joseph Modayil, David Silver

Figure 1 for On Inductive Biases in Deep Reinforcement Learning
Figure 2 for On Inductive Biases in Deep Reinforcement Learning
Figure 3 for On Inductive Biases in Deep Reinforcement Learning
Figure 4 for On Inductive Biases in Deep Reinforcement Learning
Viaarxiv icon

Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement

Jan 30, 2019
André Barreto, Diana Borsa, John Quan, Tom Schaul, David Silver, Matteo Hessel, Daniel Mankowitz, Augustin Žídek, Rémi Munos

Figure 1 for Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
Figure 2 for Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
Figure 3 for Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
Figure 4 for Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
Viaarxiv icon

An investigation of model-free planning

Jan 11, 2019
Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sébastien Racanière, Théophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy Lillicrap

Figure 1 for An investigation of model-free planning
Figure 2 for An investigation of model-free planning
Figure 3 for An investigation of model-free planning
Figure 4 for An investigation of model-free planning
Viaarxiv icon

Credit Assignment Techniques in Stochastic Computation Graphs

Jan 07, 2019
Théophane Weber, Nicolas Heess, Lars Buesing, David Silver

Figure 1 for Credit Assignment Techniques in Stochastic Computation Graphs
Figure 2 for Credit Assignment Techniques in Stochastic Computation Graphs
Figure 3 for Credit Assignment Techniques in Stochastic Computation Graphs
Figure 4 for Credit Assignment Techniques in Stochastic Computation Graphs
Viaarxiv icon

Universal Successor Features Approximators

Dec 18, 2018
Diana Borsa, André Barreto, John Quan, Daniel Mankowitz, Rémi Munos, Hado van Hasselt, David Silver, Tom Schaul

Figure 1 for Universal Successor Features Approximators
Figure 2 for Universal Successor Features Approximators
Figure 3 for Universal Successor Features Approximators
Figure 4 for Universal Successor Features Approximators
Viaarxiv icon

Bayesian Optimization in AlphaGo

Dec 17, 2018
Yutian Chen, Aja Huang, Ziyu Wang, Ioannis Antonoglou, Julian Schrittwieser, David Silver, Nando de Freitas

Figure 1 for Bayesian Optimization in AlphaGo
Figure 2 for Bayesian Optimization in AlphaGo
Figure 3 for Bayesian Optimization in AlphaGo
Figure 4 for Bayesian Optimization in AlphaGo
Viaarxiv icon