Picture for Kenny Young

Kenny Young

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

Add code
May 06, 2024
Figure 1 for Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Figure 2 for Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Figure 3 for Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Figure 4 for Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Viaarxiv icon

Iterative Option Discovery for Planning, by Planning

Add code
Oct 02, 2023
Viaarxiv icon

The Benefits of Model-Based Generalization in Reinforcement Learning

Add code
Nov 04, 2022
Figure 1 for The Benefits of Model-Based Generalization in Reinforcement Learning
Figure 2 for The Benefits of Model-Based Generalization in Reinforcement Learning
Figure 3 for The Benefits of Model-Based Generalization in Reinforcement Learning
Figure 4 for The Benefits of Model-Based Generalization in Reinforcement Learning
Viaarxiv icon

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

Add code
Jul 04, 2022
Figure 1 for Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
Figure 2 for Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
Figure 3 for Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
Figure 4 for Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
Viaarxiv icon

Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units

Add code
Oct 14, 2021
Figure 1 for Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
Figure 2 for Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
Figure 3 for Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
Figure 4 for Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
Viaarxiv icon

Hindsight Network Credit Assignment

Add code
Nov 24, 2020
Figure 1 for Hindsight Network Credit Assignment
Figure 2 for Hindsight Network Credit Assignment
Figure 3 for Hindsight Network Credit Assignment
Viaarxiv icon

Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning

Add code
Oct 28, 2020
Figure 1 for Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Figure 2 for Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Figure 3 for Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Figure 4 for Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Viaarxiv icon

Variance Reduced Advantage Estimation with $δ$ Hindsight Credit Assignment

Add code
Jan 09, 2020
Figure 1 for Variance Reduced Advantage Estimation with $δ$ Hindsight Credit Assignment
Viaarxiv icon

MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments

Add code
Mar 07, 2019
Figure 1 for MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments
Figure 2 for MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments
Figure 3 for MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments
Viaarxiv icon

Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control

Add code
May 10, 2018
Figure 1 for Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
Figure 2 for Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
Figure 3 for Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
Figure 4 for Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
Viaarxiv icon