Alert button
Picture for Tom Schaul

Tom Schaul

Alert button

Deep Q-learning from Demonstrations

Add code
Bookmark button
Alert button
Nov 22, 2017
Todd Hester, Matej Vecerik, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Gabriel Dulac-Arnold, Ian Osband, John Agapiou, Joel Z. Leibo, Audrunas Gruslys

Figure 1 for Deep Q-learning from Demonstrations
Figure 2 for Deep Q-learning from Demonstrations
Figure 3 for Deep Q-learning from Demonstrations
Viaarxiv icon

Rainbow: Combining Improvements in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 06, 2017
Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Azar, David Silver

Figure 1 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Figure 2 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Figure 3 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Figure 4 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Viaarxiv icon

StarCraft II: A New Challenge for Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 16, 2017
Oriol Vinyals, Timo Ewalds, Sergey Bartunov, Petko Georgiev, Alexander Sasha Vezhnevets, Michelle Yeo, Alireza Makhzani, Heinrich Küttler, John Agapiou, Julian Schrittwieser, John Quan, Stephen Gaffney, Stig Petersen, Karen Simonyan, Tom Schaul, Hado van Hasselt, David Silver, Timothy Lillicrap, Kevin Calderone, Paul Keet, Anthony Brunasso, David Lawrence, Anders Ekermo, Jacob Repp, Rodney Tsing

Figure 1 for StarCraft II: A New Challenge for Reinforcement Learning
Figure 2 for StarCraft II: A New Challenge for Reinforcement Learning
Figure 3 for StarCraft II: A New Challenge for Reinforcement Learning
Figure 4 for StarCraft II: A New Challenge for Reinforcement Learning
Viaarxiv icon

The Predictron: End-To-End Learning and Planning

Add code
Bookmark button
Alert button
Jul 20, 2017
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris

Figure 1 for The Predictron: End-To-End Learning and Planning
Figure 2 for The Predictron: End-To-End Learning and Planning
Figure 3 for The Predictron: End-To-End Learning and Planning
Figure 4 for The Predictron: End-To-End Learning and Planning
Viaarxiv icon

FeUdal Networks for Hierarchical Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 06, 2017
Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess, Max Jaderberg, David Silver, Koray Kavukcuoglu

Figure 1 for FeUdal Networks for Hierarchical Reinforcement Learning
Figure 2 for FeUdal Networks for Hierarchical Reinforcement Learning
Figure 3 for FeUdal Networks for Hierarchical Reinforcement Learning
Figure 4 for FeUdal Networks for Hierarchical Reinforcement Learning
Viaarxiv icon

Learning to learn by gradient descent by gradient descent

Add code
Bookmark button
Alert button
Nov 30, 2016
Marcin Andrychowicz, Misha Denil, Sergio Gomez, Matthew W. Hoffman, David Pfau, Tom Schaul, Brendan Shillingford, Nando de Freitas

Figure 1 for Learning to learn by gradient descent by gradient descent
Figure 2 for Learning to learn by gradient descent by gradient descent
Figure 3 for Learning to learn by gradient descent by gradient descent
Figure 4 for Learning to learn by gradient descent by gradient descent
Viaarxiv icon

Reinforcement Learning with Unsupervised Auxiliary Tasks

Add code
Bookmark button
Alert button
Nov 16, 2016
Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z Leibo, David Silver, Koray Kavukcuoglu

Figure 1 for Reinforcement Learning with Unsupervised Auxiliary Tasks
Figure 2 for Reinforcement Learning with Unsupervised Auxiliary Tasks
Figure 3 for Reinforcement Learning with Unsupervised Auxiliary Tasks
Figure 4 for Reinforcement Learning with Unsupervised Auxiliary Tasks
Viaarxiv icon

Unifying Count-Based Exploration and Intrinsic Motivation

Add code
Bookmark button
Alert button
Nov 07, 2016
Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Remi Munos

Figure 1 for Unifying Count-Based Exploration and Intrinsic Motivation
Figure 2 for Unifying Count-Based Exploration and Intrinsic Motivation
Figure 3 for Unifying Count-Based Exploration and Intrinsic Motivation
Figure 4 for Unifying Count-Based Exploration and Intrinsic Motivation
Viaarxiv icon

Dueling Network Architectures for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 05, 2016
Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas

Figure 1 for Dueling Network Architectures for Deep Reinforcement Learning
Figure 2 for Dueling Network Architectures for Deep Reinforcement Learning
Figure 3 for Dueling Network Architectures for Deep Reinforcement Learning
Figure 4 for Dueling Network Architectures for Deep Reinforcement Learning
Viaarxiv icon

Prioritized Experience Replay

Add code
Bookmark button
Alert button
Feb 25, 2016
Tom Schaul, John Quan, Ioannis Antonoglou, David Silver

Figure 1 for Prioritized Experience Replay
Figure 2 for Prioritized Experience Replay
Figure 3 for Prioritized Experience Replay
Figure 4 for Prioritized Experience Replay
Viaarxiv icon