Alert button
Picture for Hado van Hasselt

Hado van Hasselt

Alert button

General non-linear Bellman equations

Add code
Bookmark button
Alert button
Jul 08, 2019
Hado van Hasselt, John Quan, Matteo Hessel, Zhongwen Xu, Diana Borsa, Andre Barreto

Figure 1 for General non-linear Bellman equations
Figure 2 for General non-linear Bellman equations
Viaarxiv icon

On Inductive Biases in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 05, 2019
Matteo Hessel, Hado van Hasselt, Joseph Modayil, David Silver

Figure 1 for On Inductive Biases in Deep Reinforcement Learning
Figure 2 for On Inductive Biases in Deep Reinforcement Learning
Figure 3 for On Inductive Biases in Deep Reinforcement Learning
Figure 4 for On Inductive Biases in Deep Reinforcement Learning
Viaarxiv icon

When to use parametric models in reinforcement learning?

Add code
Bookmark button
Alert button
Jun 12, 2019
Hado van Hasselt, Matteo Hessel, John Aslanides

Figure 1 for When to use parametric models in reinforcement learning?
Figure 2 for When to use parametric models in reinforcement learning?
Figure 3 for When to use parametric models in reinforcement learning?
Figure 4 for When to use parametric models in reinforcement learning?
Viaarxiv icon

Meta-learning of Sequential Strategies

Add code
Bookmark button
Alert button
May 08, 2019
Pedro A. Ortega, Jane X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alex Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin Miller, Mohammad Azar, Ian Osband, Neil Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew Botvinick, Shane Legg

Figure 1 for Meta-learning of Sequential Strategies
Figure 2 for Meta-learning of Sequential Strategies
Figure 3 for Meta-learning of Sequential Strategies
Figure 4 for Meta-learning of Sequential Strategies
Viaarxiv icon

Universal Successor Features Approximators

Add code
Bookmark button
Alert button
Dec 18, 2018
Diana Borsa, André Barreto, John Quan, Daniel Mankowitz, Rémi Munos, Hado van Hasselt, David Silver, Tom Schaul

Figure 1 for Universal Successor Features Approximators
Figure 2 for Universal Successor Features Approximators
Figure 3 for Universal Successor Features Approximators
Figure 4 for Universal Successor Features Approximators
Viaarxiv icon

Deep Reinforcement Learning and the Deadly Triad

Add code
Bookmark button
Alert button
Dec 06, 2018
Hado van Hasselt, Yotam Doron, Florian Strub, Matteo Hessel, Nicolas Sonnerat, Joseph Modayil

Figure 1 for Deep Reinforcement Learning and the Deadly Triad
Figure 2 for Deep Reinforcement Learning and the Deadly Triad
Figure 3 for Deep Reinforcement Learning and the Deadly Triad
Figure 4 for Deep Reinforcement Learning and the Deadly Triad
Viaarxiv icon

The Barbados 2018 List of Open Issues in Continual Learning

Add code
Bookmark button
Alert button
Nov 16, 2018
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc Bellemare, Doina Precup

Viaarxiv icon

Multi-task Deep Reinforcement Learning with PopArt

Add code
Bookmark button
Alert button
Sep 12, 2018
Matteo Hessel, Hubert Soyer, Lasse Espeholt, Wojciech Czarnecki, Simon Schmitt, Hado van Hasselt

Figure 1 for Multi-task Deep Reinforcement Learning with PopArt
Figure 2 for Multi-task Deep Reinforcement Learning with PopArt
Figure 3 for Multi-task Deep Reinforcement Learning with PopArt
Figure 4 for Multi-task Deep Reinforcement Learning with PopArt
Viaarxiv icon

Unicorn: Continual Learning with a Universal, Off-policy Agent

Add code
Bookmark button
Alert button
Jul 03, 2018
Daniel J. Mankowitz, Augustin Žídek, André Barreto, Dan Horgan, Matteo Hessel, John Quan, Junhyuk Oh, Hado van Hasselt, David Silver, Tom Schaul

Figure 1 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 2 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 3 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 4 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Viaarxiv icon

Observe and Look Further: Achieving Consistent Performance on Atari

Add code
Bookmark button
Alert button
May 29, 2018
Tobias Pohlen, Bilal Piot, Todd Hester, Mohammad Gheshlaghi Azar, Dan Horgan, David Budden, Gabriel Barth-Maron, Hado van Hasselt, John Quan, Mel Večerík, Matteo Hessel, Rémi Munos, Olivier Pietquin

Figure 1 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 2 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 3 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 4 for Observe and Look Further: Achieving Consistent Performance on Atari
Viaarxiv icon