Alert button
Picture for David Silver

David Silver

Alert button

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning

Nov 07, 2017
Marc Lanctot, Vinicius Zambaldi, Audrunas Gruslys, Angeliki Lazaridou, Karl Tuyls, Julien Perolat, David Silver, Thore Graepel

Figure 1 for A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Figure 2 for A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Figure 3 for A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Figure 4 for A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Viaarxiv icon

Rainbow: Combining Improvements in Deep Reinforcement Learning

Oct 06, 2017
Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Azar, David Silver

Figure 1 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Figure 2 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Figure 3 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Figure 4 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Viaarxiv icon

StarCraft II: A New Challenge for Reinforcement Learning

Aug 16, 2017
Oriol Vinyals, Timo Ewalds, Sergey Bartunov, Petko Georgiev, Alexander Sasha Vezhnevets, Michelle Yeo, Alireza Makhzani, Heinrich Küttler, John Agapiou, Julian Schrittwieser, John Quan, Stephen Gaffney, Stig Petersen, Karen Simonyan, Tom Schaul, Hado van Hasselt, David Silver, Timothy Lillicrap, Kevin Calderone, Paul Keet, Anthony Brunasso, David Lawrence, Anders Ekermo, Jacob Repp, Rodney Tsing

Figure 1 for StarCraft II: A New Challenge for Reinforcement Learning
Figure 2 for StarCraft II: A New Challenge for Reinforcement Learning
Figure 3 for StarCraft II: A New Challenge for Reinforcement Learning
Figure 4 for StarCraft II: A New Challenge for Reinforcement Learning
Viaarxiv icon

The Predictron: End-To-End Learning and Planning

Jul 20, 2017
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris

Figure 1 for The Predictron: End-To-End Learning and Planning
Figure 2 for The Predictron: End-To-End Learning and Planning
Figure 3 for The Predictron: End-To-End Learning and Planning
Figure 4 for The Predictron: End-To-End Learning and Planning
Viaarxiv icon

Emergence of Locomotion Behaviours in Rich Environments

Jul 10, 2017
Nicolas Heess, Dhruva TB, Srinivasan Sriram, Jay Lemmon, Josh Merel, Greg Wayne, Yuval Tassa, Tom Erez, Ziyu Wang, S. M. Ali Eslami, Martin Riedmiller, David Silver

Figure 1 for Emergence of Locomotion Behaviours in Rich Environments
Figure 2 for Emergence of Locomotion Behaviours in Rich Environments
Figure 3 for Emergence of Locomotion Behaviours in Rich Environments
Figure 4 for Emergence of Locomotion Behaviours in Rich Environments
Viaarxiv icon

Decoupled Neural Interfaces using Synthetic Gradients

Jul 03, 2017
Max Jaderberg, Wojciech Marian Czarnecki, Simon Osindero, Oriol Vinyals, Alex Graves, David Silver, Koray Kavukcuoglu

Figure 1 for Decoupled Neural Interfaces using Synthetic Gradients
Figure 2 for Decoupled Neural Interfaces using Synthetic Gradients
Figure 3 for Decoupled Neural Interfaces using Synthetic Gradients
Figure 4 for Decoupled Neural Interfaces using Synthetic Gradients
Viaarxiv icon

FeUdal Networks for Hierarchical Reinforcement Learning

Mar 06, 2017
Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess, Max Jaderberg, David Silver, Koray Kavukcuoglu

Figure 1 for FeUdal Networks for Hierarchical Reinforcement Learning
Figure 2 for FeUdal Networks for Hierarchical Reinforcement Learning
Figure 3 for FeUdal Networks for Hierarchical Reinforcement Learning
Figure 4 for FeUdal Networks for Hierarchical Reinforcement Learning
Viaarxiv icon

Reinforcement Learning with Unsupervised Auxiliary Tasks

Nov 16, 2016
Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z Leibo, David Silver, Koray Kavukcuoglu

Figure 1 for Reinforcement Learning with Unsupervised Auxiliary Tasks
Figure 2 for Reinforcement Learning with Unsupervised Auxiliary Tasks
Figure 3 for Reinforcement Learning with Unsupervised Auxiliary Tasks
Figure 4 for Reinforcement Learning with Unsupervised Auxiliary Tasks
Viaarxiv icon

Learning and Transfer of Modulated Locomotor Controllers

Oct 17, 2016
Nicolas Heess, Greg Wayne, Yuval Tassa, Timothy Lillicrap, Martin Riedmiller, David Silver

Figure 1 for Learning and Transfer of Modulated Locomotor Controllers
Figure 2 for Learning and Transfer of Modulated Locomotor Controllers
Figure 3 for Learning and Transfer of Modulated Locomotor Controllers
Figure 4 for Learning and Transfer of Modulated Locomotor Controllers
Viaarxiv icon

Learning values across many orders of magnitude

Aug 16, 2016
Hado van Hasselt, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver

Figure 1 for Learning values across many orders of magnitude
Figure 2 for Learning values across many orders of magnitude
Figure 3 for Learning values across many orders of magnitude
Viaarxiv icon