Alert button
Picture for Tom Schaul

Tom Schaul

Alert button

Adapting Behaviour for Learning Progress

Add code
Bookmark button
Alert button
Dec 14, 2019
Tom Schaul, Diana Borsa, David Ding, David Szepesvari, Georg Ostrovski, Will Dabney, Simon Osindero

Figure 1 for Adapting Behaviour for Learning Progress
Figure 2 for Adapting Behaviour for Learning Progress
Figure 3 for Adapting Behaviour for Learning Progress
Figure 4 for Adapting Behaviour for Learning Progress
Viaarxiv icon

Conditional Importance Sampling for Off-Policy Learning

Add code
Bookmark button
Alert button
Oct 16, 2019
Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney

Figure 1 for Conditional Importance Sampling for Off-Policy Learning
Figure 2 for Conditional Importance Sampling for Off-Policy Learning
Figure 3 for Conditional Importance Sampling for Off-Policy Learning
Figure 4 for Conditional Importance Sampling for Off-Policy Learning
Viaarxiv icon

Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods

Add code
Bookmark button
Alert button
Jun 07, 2019
Karel Lenc, Erich Elsen, Tom Schaul, Karen Simonyan

Figure 1 for Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods
Figure 2 for Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods
Figure 3 for Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods
Figure 4 for Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods
Viaarxiv icon

Ray Interference: a Source of Plateaus in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 25, 2019
Tom Schaul, Diana Borsa, Joseph Modayil, Razvan Pascanu

Figure 1 for Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Figure 2 for Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Figure 3 for Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Figure 4 for Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Viaarxiv icon

Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement

Add code
Bookmark button
Alert button
Jan 30, 2019
André Barreto, Diana Borsa, John Quan, Tom Schaul, David Silver, Matteo Hessel, Daniel Mankowitz, Augustin Žídek, Rémi Munos

Figure 1 for Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
Figure 2 for Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
Figure 3 for Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
Figure 4 for Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
Viaarxiv icon

Universal Successor Features Approximators

Add code
Bookmark button
Alert button
Dec 18, 2018
Diana Borsa, André Barreto, John Quan, Daniel Mankowitz, Rémi Munos, Hado van Hasselt, David Silver, Tom Schaul

Figure 1 for Universal Successor Features Approximators
Figure 2 for Universal Successor Features Approximators
Figure 3 for Universal Successor Features Approximators
Figure 4 for Universal Successor Features Approximators
Viaarxiv icon

The Barbados 2018 List of Open Issues in Continual Learning

Add code
Bookmark button
Alert button
Nov 16, 2018
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc Bellemare, Doina Precup

Viaarxiv icon

Unicorn: Continual Learning with a Universal, Off-policy Agent

Add code
Bookmark button
Alert button
Jul 03, 2018
Daniel J. Mankowitz, Augustin Žídek, André Barreto, Dan Horgan, Matteo Hessel, John Quan, Junhyuk Oh, Hado van Hasselt, David Silver, Tom Schaul

Figure 1 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 2 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 3 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 4 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Viaarxiv icon

Meta-Learning by the Baldwin Effect

Add code
Bookmark button
Alert button
Jun 22, 2018
Chrisantha Thomas Fernando, Jakub Sygnowski, Simon Osindero, Jane Wang, Tom Schaul, Denis Teplyashin, Pablo Sprechmann, Alexander Pritzel, Andrei A. Rusu

Figure 1 for Meta-Learning by the Baldwin Effect
Figure 2 for Meta-Learning by the Baldwin Effect
Figure 3 for Meta-Learning by the Baldwin Effect
Figure 4 for Meta-Learning by the Baldwin Effect
Viaarxiv icon

Successor Features for Transfer in Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 12, 2018
André Barreto, Will Dabney, Rémi Munos, Jonathan J. Hunt, Tom Schaul, Hado van Hasselt, David Silver

Figure 1 for Successor Features for Transfer in Reinforcement Learning
Figure 2 for Successor Features for Transfer in Reinforcement Learning
Figure 3 for Successor Features for Transfer in Reinforcement Learning
Figure 4 for Successor Features for Transfer in Reinforcement Learning
Viaarxiv icon