Alert button
Picture for Hado van Hasselt

Hado van Hasselt

Alert button

Chaining Value Functions for Off-Policy Learning

Jan 17, 2022
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt

Figure 1 for Chaining Value Functions for Off-Policy Learning
Figure 2 for Chaining Value Functions for Off-Policy Learning
Figure 3 for Chaining Value Functions for Off-Policy Learning
Figure 4 for Chaining Value Functions for Off-Policy Learning
Viaarxiv icon

Self-Consistent Models and Values

Oct 25, 2021
Gregory Farquhar, Kate Baumli, Zita Marinho, Angelos Filos, Matteo Hessel, Hado van Hasselt, David Silver

Figure 1 for Self-Consistent Models and Values
Figure 2 for Self-Consistent Models and Values
Figure 3 for Self-Consistent Models and Values
Figure 4 for Self-Consistent Models and Values
Viaarxiv icon

Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity

Oct 08, 2021
Marta Garnelo, Wojciech Marian Czarnecki, Siqi Liu, Dhruva Tirumala, Junhyuk Oh, Gauthier Gidel, Hado van Hasselt, David Balduzzi

Viaarxiv icon

Introducing Symmetries to Black Box Meta Reinforcement Learning

Sep 22, 2021
Louis Kirsch, Sebastian Flennerhag, Hado van Hasselt, Abram Friesen, Junhyuk Oh, Yutian Chen

Figure 1 for Introducing Symmetries to Black Box Meta Reinforcement Learning
Figure 2 for Introducing Symmetries to Black Box Meta Reinforcement Learning
Figure 3 for Introducing Symmetries to Black Box Meta Reinforcement Learning
Figure 4 for Introducing Symmetries to Black Box Meta Reinforcement Learning
Viaarxiv icon

Bootstrapped Meta-Learning

Sep 09, 2021
Sebastian Flennerhag, Yannick Schroecker, Tom Zahavy, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Bootstrapped Meta-Learning
Figure 2 for Bootstrapped Meta-Learning
Figure 3 for Bootstrapped Meta-Learning
Figure 4 for Bootstrapped Meta-Learning
Viaarxiv icon

Learning Expected Emphatic Traces for Deep RL

Jul 12, 2021
Ray Jiang, Shangtong Zhang, Veronica Chelu, Adam White, Hado van Hasselt

Figure 1 for Learning Expected Emphatic Traces for Deep RL
Figure 2 for Learning Expected Emphatic Traces for Deep RL
Figure 3 for Learning Expected Emphatic Traces for Deep RL
Figure 4 for Learning Expected Emphatic Traces for Deep RL
Viaarxiv icon

Emphatic Algorithms for Deep Reinforcement Learning

Jun 21, 2021
Ray Jiang, Tom Zahavy, Zhongwen Xu, Adam White, Matteo Hessel, Charles Blundell, Hado van Hasselt

Figure 1 for Emphatic Algorithms for Deep Reinforcement Learning
Figure 2 for Emphatic Algorithms for Deep Reinforcement Learning
Figure 3 for Emphatic Algorithms for Deep Reinforcement Learning
Figure 4 for Emphatic Algorithms for Deep Reinforcement Learning
Viaarxiv icon

Podracer architectures for scalable Reinforcement Learning

Apr 13, 2021
Matteo Hessel, Manuel Kroiss, Aidan Clark, Iurii Kemaev, John Quan, Thomas Keck, Fabio Viola, Hado van Hasselt

Figure 1 for Podracer architectures for scalable Reinforcement Learning
Figure 2 for Podracer architectures for scalable Reinforcement Learning
Figure 3 for Podracer architectures for scalable Reinforcement Learning
Figure 4 for Podracer architectures for scalable Reinforcement Learning
Viaarxiv icon

Muesli: Combining Improvements in Policy Optimization

Apr 13, 2021
Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt

Figure 1 for Muesli: Combining Improvements in Policy Optimization
Figure 2 for Muesli: Combining Improvements in Policy Optimization
Figure 3 for Muesli: Combining Improvements in Policy Optimization
Figure 4 for Muesli: Combining Improvements in Policy Optimization
Viaarxiv icon

Synthetic Returns for Long-Term Credit Assignment

Feb 24, 2021
David Raposo, Sam Ritter, Adam Santoro, Greg Wayne, Theophane Weber, Matt Botvinick, Hado van Hasselt, Francis Song

Figure 1 for Synthetic Returns for Long-Term Credit Assignment
Figure 2 for Synthetic Returns for Long-Term Credit Assignment
Figure 3 for Synthetic Returns for Long-Term Credit Assignment
Figure 4 for Synthetic Returns for Long-Term Credit Assignment
Viaarxiv icon