Alert button
Picture for Emma Brunskill

Emma Brunskill

Alert button

Sublinear Optimal Policy Value Estimation in Contextual Bandits

Add code
Bookmark button
Alert button
Dec 12, 2019
Weihao Kong, Gregory Valiant, Emma Brunskill

Figure 1 for Sublinear Optimal Policy Value Estimation in Contextual Bandits
Figure 2 for Sublinear Optimal Policy Value Estimation in Contextual Bandits
Viaarxiv icon

Missingness as Stability: Understanding the Structure of Missingness in Longitudinal EHR data and its Impact on Reinforcement Learning in Healthcare

Add code
Bookmark button
Alert button
Nov 16, 2019
Scott L. Fleming, Kuhan Jeyapragasan, Tony Duan, Daisy Ding, Saurabh Gombar, Nigam Shah, Emma Brunskill

Figure 1 for Missingness as Stability: Understanding the Structure of Missingness in Longitudinal EHR data and its Impact on Reinforcement Learning in Healthcare
Figure 2 for Missingness as Stability: Understanding the Structure of Missingness in Longitudinal EHR data and its Impact on Reinforcement Learning in Healthcare
Figure 3 for Missingness as Stability: Understanding the Structure of Missingness in Longitudinal EHR data and its Impact on Reinforcement Learning in Healthcare
Viaarxiv icon

Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy

Add code
Bookmark button
Alert button
Nov 05, 2019
Ramtin Keramati, Christoph Dann, Alex Tamkin, Emma Brunskill

Figure 1 for Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
Figure 2 for Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
Figure 3 for Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
Figure 4 for Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
Viaarxiv icon

Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs

Add code
Bookmark button
Alert button
Nov 03, 2019
Andrea Zanette, Emma Brunskill

Viaarxiv icon

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling

Add code
Bookmark button
Alert button
Oct 15, 2019
Yao Liu, Pierre-Luc Bacon, Emma Brunskill

Figure 1 for Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling
Figure 2 for Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling
Figure 3 for Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling
Viaarxiv icon

Directed Exploration for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 18, 2019
Zhaohan Daniel Guo, Emma Brunskill

Figure 1 for Directed Exploration for Reinforcement Learning
Figure 2 for Directed Exploration for Reinforcement Learning
Figure 3 for Directed Exploration for Reinforcement Learning
Figure 4 for Directed Exploration for Reinforcement Learning
Viaarxiv icon

Learning When-to-Treat Policies

Add code
Bookmark button
Alert button
May 23, 2019
Xinkun Nie, Emma Brunskill, Stefan Wager

Figure 1 for Learning When-to-Treat Policies
Figure 2 for Learning When-to-Treat Policies
Figure 3 for Learning When-to-Treat Policies
Figure 4 for Learning When-to-Treat Policies
Viaarxiv icon

Combining Parametric and Nonparametric Models for Off-Policy Evaluation

Add code
Bookmark button
Alert button
May 16, 2019
Omer Gottesman, Yao Liu, Scott Sussex, Emma Brunskill, Finale Doshi-Velez

Figure 1 for Combining Parametric and Nonparametric Models for Off-Policy Evaluation
Figure 2 for Combining Parametric and Nonparametric Models for Off-Policy Evaluation
Figure 3 for Combining Parametric and Nonparametric Models for Off-Policy Evaluation
Figure 4 for Combining Parametric and Nonparametric Models for Off-Policy Evaluation
Viaarxiv icon

PLOTS: Procedure Learning from Observations using Subtask Structure

Add code
Bookmark button
Alert button
Apr 17, 2019
Tong Mu, Karan Goel, Emma Brunskill

Figure 1 for PLOTS: Procedure Learning from Observations using Subtask Structure
Figure 2 for PLOTS: Procedure Learning from Observations using Subtask Structure
Figure 3 for PLOTS: Procedure Learning from Observations using Subtask Structure
Figure 4 for PLOTS: Procedure Learning from Observations using Subtask Structure
Viaarxiv icon

Off-Policy Policy Gradient with State Distribution Correction

Add code
Bookmark button
Alert button
Apr 17, 2019
Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

Figure 1 for Off-Policy Policy Gradient with State Distribution Correction
Figure 2 for Off-Policy Policy Gradient with State Distribution Correction
Figure 3 for Off-Policy Policy Gradient with State Distribution Correction
Figure 4 for Off-Policy Policy Gradient with State Distribution Correction
Viaarxiv icon