Picture for Hado van Hasselt

Hado van Hasselt

Emphatic Algorithms for Deep Reinforcement Learning

Add code
Jun 21, 2021
Figure 1 for Emphatic Algorithms for Deep Reinforcement Learning
Figure 2 for Emphatic Algorithms for Deep Reinforcement Learning
Figure 3 for Emphatic Algorithms for Deep Reinforcement Learning
Figure 4 for Emphatic Algorithms for Deep Reinforcement Learning
Viaarxiv icon

Podracer architectures for scalable Reinforcement Learning

Add code
Apr 13, 2021
Figure 1 for Podracer architectures for scalable Reinforcement Learning
Figure 2 for Podracer architectures for scalable Reinforcement Learning
Figure 3 for Podracer architectures for scalable Reinforcement Learning
Figure 4 for Podracer architectures for scalable Reinforcement Learning
Viaarxiv icon

Muesli: Combining Improvements in Policy Optimization

Add code
Apr 13, 2021
Figure 1 for Muesli: Combining Improvements in Policy Optimization
Figure 2 for Muesli: Combining Improvements in Policy Optimization
Figure 3 for Muesli: Combining Improvements in Policy Optimization
Figure 4 for Muesli: Combining Improvements in Policy Optimization
Viaarxiv icon

Synthetic Returns for Long-Term Credit Assignment

Add code
Feb 24, 2021
Figure 1 for Synthetic Returns for Long-Term Credit Assignment
Figure 2 for Synthetic Returns for Long-Term Credit Assignment
Figure 3 for Synthetic Returns for Long-Term Credit Assignment
Figure 4 for Synthetic Returns for Long-Term Credit Assignment
Viaarxiv icon

Discovery of Options via Meta-Learned Subgoals

Add code
Feb 12, 2021
Figure 1 for Discovery of Options via Meta-Learned Subgoals
Figure 2 for Discovery of Options via Meta-Learned Subgoals
Figure 3 for Discovery of Options via Meta-Learned Subgoals
Figure 4 for Discovery of Options via Meta-Learned Subgoals
Viaarxiv icon

Forethought and Hindsight in Credit Assignment

Add code
Oct 26, 2020
Figure 1 for Forethought and Hindsight in Credit Assignment
Figure 2 for Forethought and Hindsight in Credit Assignment
Figure 3 for Forethought and Hindsight in Credit Assignment
Figure 4 for Forethought and Hindsight in Credit Assignment
Viaarxiv icon

Discovering Reinforcement Learning Algorithms

Add code
Jul 17, 2020
Figure 1 for Discovering Reinforcement Learning Algorithms
Figure 2 for Discovering Reinforcement Learning Algorithms
Figure 3 for Discovering Reinforcement Learning Algorithms
Figure 4 for Discovering Reinforcement Learning Algorithms
Viaarxiv icon

Meta-Gradient Reinforcement Learning with an Objective Discovered Online

Add code
Jul 16, 2020
Figure 1 for Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Figure 2 for Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Figure 3 for Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Figure 4 for Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Viaarxiv icon

Expected Eligibility Traces

Add code
Jul 03, 2020
Figure 1 for Expected Eligibility Traces
Figure 2 for Expected Eligibility Traces
Figure 3 for Expected Eligibility Traces
Figure 4 for Expected Eligibility Traces
Viaarxiv icon

Self-Tuning Deep Reinforcement Learning

Add code
Mar 02, 2020
Figure 1 for Self-Tuning Deep Reinforcement Learning
Figure 2 for Self-Tuning Deep Reinforcement Learning
Figure 3 for Self-Tuning Deep Reinforcement Learning
Figure 4 for Self-Tuning Deep Reinforcement Learning
Viaarxiv icon