Picture for Martha White

Martha White

Demystifying the Recency Heuristic in Temporal-Difference Learning

Add code
Jun 18, 2024
Figure 1 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 2 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 3 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 4 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Viaarxiv icon

A New View on Planning in Online Reinforcement Learning

Add code
Jun 03, 2024
Figure 1 for A New View on Planning in Online Reinforcement Learning
Figure 2 for A New View on Planning in Online Reinforcement Learning
Figure 3 for A New View on Planning in Online Reinforcement Learning
Figure 4 for A New View on Planning in Online Reinforcement Learning
Viaarxiv icon

Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL

Add code
Apr 02, 2024
Figure 1 for Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
Figure 2 for Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
Figure 3 for Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
Figure 4 for Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
Viaarxiv icon

Investigating the Histogram Loss in Regression

Add code
Feb 20, 2024
Figure 1 for Investigating the Histogram Loss in Regression
Figure 2 for Investigating the Histogram Loss in Regression
Figure 3 for Investigating the Histogram Loss in Regression
Figure 4 for Investigating the Histogram Loss in Regression
Viaarxiv icon

What to Do When Your Discrete Optimization Is the Size of a Neural Network?

Add code
Feb 15, 2024
Figure 1 for What to Do When Your Discrete Optimization Is the Size of a Neural Network?
Figure 2 for What to Do When Your Discrete Optimization Is the Size of a Neural Network?
Figure 3 for What to Do When Your Discrete Optimization Is the Size of a Neural Network?
Figure 4 for What to Do When Your Discrete Optimization Is the Size of a Neural Network?
Viaarxiv icon

Compound Returns Reduce Variance in Reinforcement Learning

Add code
Feb 06, 2024
Figure 1 for Compound Returns Reduce Variance in Reinforcement Learning
Figure 2 for Compound Returns Reduce Variance in Reinforcement Learning
Figure 3 for Compound Returns Reduce Variance in Reinforcement Learning
Figure 4 for Compound Returns Reduce Variance in Reinforcement Learning
Viaarxiv icon

When is Offline Policy Selection Sample Efficient for Reinforcement Learning?

Add code
Dec 04, 2023
Figure 1 for When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Figure 2 for When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Figure 3 for When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Figure 4 for When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Viaarxiv icon

GVFs in the Real World: Making Predictions Online for Water Treatment

Add code
Dec 04, 2023
Viaarxiv icon

Measuring and Mitigating Interference in Reinforcement Learning

Add code
Jul 10, 2023
Figure 1 for Measuring and Mitigating Interference in Reinforcement Learning
Figure 2 for Measuring and Mitigating Interference in Reinforcement Learning
Figure 3 for Measuring and Mitigating Interference in Reinforcement Learning
Figure 4 for Measuring and Mitigating Interference in Reinforcement Learning
Viaarxiv icon

Coagent Networks: Generalized and Scaled

Add code
May 16, 2023
Figure 1 for Coagent Networks: Generalized and Scaled
Figure 2 for Coagent Networks: Generalized and Scaled
Figure 3 for Coagent Networks: Generalized and Scaled
Figure 4 for Coagent Networks: Generalized and Scaled
Viaarxiv icon