Self-Tuning Deep Reinforcement Learning

Mar 02, 2020
Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh


  Access Model/Code and Paper
Value-driven Hindsight Modelling

Feb 19, 2020
Arthur Guez, Fabio Viola, Théophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess

* 8 pages + reference + appendix 

  Access Model/Code and Paper
What Can Learned Intrinsic Rewards Capture?

Dec 11, 2019
Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh


  Access Model/Code and Paper
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

Nov 19, 2019
Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, Demis Hassabis, Thore Graepel, Timothy Lillicrap, David Silver


  Access Model/Code and Paper
Discovery of Useful Questions as Auxiliary Tasks

Sep 10, 2019
Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Richard Lewis, Janarthanan Rajendran, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh


  Access Model/Code and Paper
Behaviour Suite for Reinforcement Learning

Aug 13, 2019
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepezvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt


  Access Model/Code and Paper
On Inductive Biases in Deep Reinforcement Learning

Jul 05, 2019
Matteo Hessel, Hado van Hasselt, Joseph Modayil, David Silver


  Access Model/Code and Paper
Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement

Jan 30, 2019
André Barreto, Diana Borsa, John Quan, Tom Schaul, David Silver, Matteo Hessel, Daniel Mankowitz, Augustin Žídek, Rémi Munos

* Published at ICML 2018 

  Access Model/Code and Paper
An investigation of model-free planning

Jan 11, 2019
Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sébastien Racanière, Théophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy Lillicrap


  Access Model/Code and Paper
Credit Assignment Techniques in Stochastic Computation Graphs

Jan 07, 2019
Théophane Weber, Nicolas Heess, Lars Buesing, David Silver


  Access Model/Code and Paper
Universal Successor Features Approximators

Dec 18, 2018
Diana Borsa, André Barreto, John Quan, Daniel Mankowitz, Rémi Munos, Hado van Hasselt, David Silver, Tom Schaul


  Access Model/Code and Paper
Bayesian Optimization in AlphaGo

Dec 17, 2018
Yutian Chen, Aja Huang, Ziyu Wang, Ioannis Antonoglou, Julian Schrittwieser, David Silver, Nando de Freitas


  Access Model/Code and Paper
Learning to Search with MCTSnets

Jul 17, 2018
Arthur Guez, Théophane Weber, Ioannis Antonoglou, Karen Simonyan, Oriol Vinyals, Daan Wierstra, Rémi Munos, David Silver

* ICML 2018 (camera-ready version) 

  Access Model/Code and Paper
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

Jul 03, 2018
Max Jaderberg, Wojciech M. Czarnecki, Iain Dunning, Luke Marris, Guy Lever, Antonio Garcia Castaneda, Charles Beattie, Neil C. Rabinowitz, Ari S. Morcos, Avraham Ruderman, Nicolas Sonnerat, Tim Green, Louise Deason, Joel Z. Leibo, David Silver, Demis Hassabis, Koray Kavukcuoglu, Thore Graepel


  Access Model/Code and Paper
Unicorn: Continual Learning with a Universal, Off-policy Agent

Jul 03, 2018
Daniel J. Mankowitz, Augustin Žídek, André Barreto, Dan Horgan, Matteo Hessel, John Quan, Junhyuk Oh, Hado van Hasselt, David Silver, Tom Schaul


  Access Model/Code and Paper
Implicit Quantile Networks for Distributional Reinforcement Learning

Jun 14, 2018
Will Dabney, Georg Ostrovski, David Silver, Rémi Munos

* ICML 2018 

  Access Model/Code and Paper
Meta-Gradient Reinforcement Learning

May 24, 2018
Zhongwen Xu, Hado van Hasselt, David Silver


  Access Model/Code and Paper
Successor Features for Transfer in Reinforcement Learning

Apr 12, 2018
André Barreto, Will Dabney, Rémi Munos, Jonathan J. Hunt, Tom Schaul, Hado van Hasselt, David Silver

* Published at NIPS 2017 

  Access Model/Code and Paper
Unsupervised Predictive Memory in a Goal-Directed Agent

Mar 28, 2018
Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap


  Access Model/Code and Paper
Distributed Prioritized Experience Replay

Mar 02, 2018
Dan Horgan, John Quan, David Budden, Gabriel Barth-Maron, Matteo Hessel, Hado van Hasselt, David Silver

* Accepted to International Conference on Learning Representations 2018 

  Access Model/Code and Paper
Imagination-Augmented Agents for Deep Reinforcement Learning

Feb 14, 2018
Théophane Weber, Sébastien Racanière, David P. Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adria Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter Battaglia, Demis Hassabis, David Silver, Daan Wierstra


  Access Model/Code and Paper
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Dec 05, 2017
David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, Timothy Lillicrap, Karen Simonyan, Demis Hassabis


  Access Model/Code and Paper
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning

Nov 07, 2017
Marc Lanctot, Vinicius Zambaldi, Audrunas Gruslys, Angeliki Lazaridou, Karl Tuyls, Julien Perolat, David Silver, Thore Graepel

* Camera-ready copy of NIPS 2017 paper, including appendix 

  Access Model/Code and Paper
Rainbow: Combining Improvements in Deep Reinforcement Learning

Oct 06, 2017
Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Azar, David Silver

* Under review as a conference paper at AAAI 2018 

  Access Model/Code and Paper
StarCraft II: A New Challenge for Reinforcement Learning

Aug 16, 2017
Oriol Vinyals, Timo Ewalds, Sergey Bartunov, Petko Georgiev, Alexander Sasha Vezhnevets, Michelle Yeo, Alireza Makhzani, Heinrich Küttler, John Agapiou, Julian Schrittwieser, John Quan, Stephen Gaffney, Stig Petersen, Karen Simonyan, Tom Schaul, Hado van Hasselt, David Silver, Timothy Lillicrap, Kevin Calderone, Paul Keet, Anthony Brunasso, David Lawrence, Anders Ekermo, Jacob Repp, Rodney Tsing

* Collaboration between DeepMind & Blizzard. 20 pages, 9 figures, 2 tables 

  Access Model/Code and Paper
The Predictron: End-To-End Learning and Planning

Jul 20, 2017
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris

* Camera-ready version, ICML 2017, with supplement 

  Access Model/Code and Paper
Emergence of Locomotion Behaviours in Rich Environments

Jul 10, 2017
Nicolas Heess, Dhruva TB, Srinivasan Sriram, Jay Lemmon, Josh Merel, Greg Wayne, Yuval Tassa, Tom Erez, Ziyu Wang, S. M. Ali Eslami, Martin Riedmiller, David Silver


  Access Model/Code and Paper
Decoupled Neural Interfaces using Synthetic Gradients

Jul 03, 2017
Max Jaderberg, Wojciech Marian Czarnecki, Simon Osindero, Oriol Vinyals, Alex Graves, David Silver, Koray Kavukcuoglu


  Access Model/Code and Paper