Alert button
Picture for Martha White

Martha White

Alert button

Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments

Add code
Bookmark button
Alert button
Feb 23, 2023
Vincent Liu, Yash Chandak, Philip Thomas, Martha White

Figure 1 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Figure 2 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Figure 3 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Figure 4 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Viaarxiv icon

Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence

Add code
Bookmark button
Alert button
Jan 27, 2023
Lingwei Zhu, Zheng Chen, Takamitsu Matsubara, Martha White

Figure 1 for Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence
Figure 2 for Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence
Figure 3 for Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence
Figure 4 for Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence
Viaarxiv icon

Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 26, 2023
Brett Daley, Martha White, Christopher Amato, Marlos C. Machado

Figure 1 for Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Figure 2 for Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Figure 3 for Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Figure 4 for Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Viaarxiv icon

Goal-Space Planning with Subgoal Models

Add code
Bookmark button
Alert button
Jun 08, 2022
Chunlok Lo, Gabor Mihucz, Adam White, Farzane Aminmansour, Martha White

Viaarxiv icon

No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

Add code
Bookmark button
Alert button
May 18, 2022
Han Wang, Archit Sakhadeo, Adam White, James Bell, Vincent Liu, Xutong Zhao, Puer Liu, Tadashi Kozuno, Alona Fyshe, Martha White

Figure 1 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Figure 2 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Figure 3 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Figure 4 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Viaarxiv icon

Robust Losses for Learning Value Functions

Add code
Bookmark button
Alert button
May 17, 2022
Andrew Patterson, Victor Liao, Martha White

Figure 1 for Robust Losses for Learning Value Functions
Figure 2 for Robust Losses for Learning Value Functions
Figure 3 for Robust Losses for Learning Value Functions
Figure 4 for Robust Losses for Learning Value Functions
Viaarxiv icon

Investigating the Properties of Neural Network Representations in Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 30, 2022
Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White

Figure 1 for Investigating the Properties of Neural Network Representations in Reinforcement Learning
Figure 2 for Investigating the Properties of Neural Network Representations in Reinforcement Learning
Figure 3 for Investigating the Properties of Neural Network Representations in Reinforcement Learning
Figure 4 for Investigating the Properties of Neural Network Representations in Reinforcement Learning
Viaarxiv icon

Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum

Add code
Bookmark button
Alert button
Mar 22, 2022
Kirby Banman, Liam Peet-Pare, Nidhi Hegde, Alona Fyshe, Martha White

Figure 1 for Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum
Figure 2 for Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum
Figure 3 for Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum
Figure 4 for Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum
Viaarxiv icon

Continual Auxiliary Task Learning

Add code
Bookmark button
Alert button
Feb 22, 2022
Matthew McLeod, Chunlok Lo, Matthew Schlegel, Andrew Jacobsen, Raksha Kumaraswamy, Martha White, Adam White

Figure 1 for Continual Auxiliary Task Learning
Figure 2 for Continual Auxiliary Task Learning
Figure 3 for Continual Auxiliary Task Learning
Figure 4 for Continual Auxiliary Task Learning
Viaarxiv icon

A Temporal-Difference Approach to Policy Gradient Estimation

Add code
Bookmark button
Alert button
Feb 04, 2022
Samuele Tosatto, Andrew Patterson, Martha White, A. Rupam Mahmood

Figure 1 for A Temporal-Difference Approach to Policy Gradient Estimation
Figure 2 for A Temporal-Difference Approach to Policy Gradient Estimation
Figure 3 for A Temporal-Difference Approach to Policy Gradient Estimation
Figure 4 for A Temporal-Difference Approach to Policy Gradient Estimation
Viaarxiv icon