Alert button
Picture for David Silver

David Silver

Alert button

Learning to Search with MCTSnets

Jul 17, 2018
Arthur Guez, Théophane Weber, Ioannis Antonoglou, Karen Simonyan, Oriol Vinyals, Daan Wierstra, Rémi Munos, David Silver

Figure 1 for Learning to Search with MCTSnets
Figure 2 for Learning to Search with MCTSnets
Figure 3 for Learning to Search with MCTSnets
Figure 4 for Learning to Search with MCTSnets
Viaarxiv icon

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

Jul 03, 2018
Max Jaderberg, Wojciech M. Czarnecki, Iain Dunning, Luke Marris, Guy Lever, Antonio Garcia Castaneda, Charles Beattie, Neil C. Rabinowitz, Ari S. Morcos, Avraham Ruderman, Nicolas Sonnerat, Tim Green, Louise Deason, Joel Z. Leibo, David Silver, Demis Hassabis, Koray Kavukcuoglu, Thore Graepel

Viaarxiv icon

Unicorn: Continual Learning with a Universal, Off-policy Agent

Jul 03, 2018
Daniel J. Mankowitz, Augustin Žídek, André Barreto, Dan Horgan, Matteo Hessel, John Quan, Junhyuk Oh, Hado van Hasselt, David Silver, Tom Schaul

Figure 1 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 2 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 3 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 4 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Viaarxiv icon

Implicit Quantile Networks for Distributional Reinforcement Learning

Jun 14, 2018
Will Dabney, Georg Ostrovski, David Silver, Rémi Munos

Figure 1 for Implicit Quantile Networks for Distributional Reinforcement Learning
Figure 2 for Implicit Quantile Networks for Distributional Reinforcement Learning
Figure 3 for Implicit Quantile Networks for Distributional Reinforcement Learning
Figure 4 for Implicit Quantile Networks for Distributional Reinforcement Learning
Viaarxiv icon

Meta-Gradient Reinforcement Learning

May 24, 2018
Zhongwen Xu, Hado van Hasselt, David Silver

Figure 1 for Meta-Gradient Reinforcement Learning
Figure 2 for Meta-Gradient Reinforcement Learning
Figure 3 for Meta-Gradient Reinforcement Learning
Figure 4 for Meta-Gradient Reinforcement Learning
Viaarxiv icon

Successor Features for Transfer in Reinforcement Learning

Apr 12, 2018
André Barreto, Will Dabney, Rémi Munos, Jonathan J. Hunt, Tom Schaul, Hado van Hasselt, David Silver

Figure 1 for Successor Features for Transfer in Reinforcement Learning
Figure 2 for Successor Features for Transfer in Reinforcement Learning
Figure 3 for Successor Features for Transfer in Reinforcement Learning
Figure 4 for Successor Features for Transfer in Reinforcement Learning
Viaarxiv icon

Unsupervised Predictive Memory in a Goal-Directed Agent

Mar 28, 2018
Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap

Figure 1 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 2 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 3 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 4 for Unsupervised Predictive Memory in a Goal-Directed Agent
Viaarxiv icon

Distributed Prioritized Experience Replay

Mar 02, 2018
Dan Horgan, John Quan, David Budden, Gabriel Barth-Maron, Matteo Hessel, Hado van Hasselt, David Silver

Figure 1 for Distributed Prioritized Experience Replay
Figure 2 for Distributed Prioritized Experience Replay
Figure 3 for Distributed Prioritized Experience Replay
Figure 4 for Distributed Prioritized Experience Replay
Viaarxiv icon

Imagination-Augmented Agents for Deep Reinforcement Learning

Feb 14, 2018
Théophane Weber, Sébastien Racanière, David P. Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adria Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter Battaglia, Demis Hassabis, David Silver, Daan Wierstra

Figure 1 for Imagination-Augmented Agents for Deep Reinforcement Learning
Figure 2 for Imagination-Augmented Agents for Deep Reinforcement Learning
Figure 3 for Imagination-Augmented Agents for Deep Reinforcement Learning
Figure 4 for Imagination-Augmented Agents for Deep Reinforcement Learning
Viaarxiv icon

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Dec 05, 2017
David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, Timothy Lillicrap, Karen Simonyan, Demis Hassabis

Figure 1 for Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Figure 2 for Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Figure 3 for Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Figure 4 for Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Viaarxiv icon