Picture for Emma Brunskill

Emma Brunskill

Stanford University

Adaptive Interventions with User-Defined Goals for Health Behavior Change

Add code
Nov 16, 2023
Figure 1 for Adaptive Interventions with User-Defined Goals for Health Behavior Change
Figure 2 for Adaptive Interventions with User-Defined Goals for Health Behavior Change
Figure 3 for Adaptive Interventions with User-Defined Goals for Health Behavior Change
Viaarxiv icon

Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization

Add code
Jul 05, 2023
Figure 1 for Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization
Viaarxiv icon

Supervised Pretraining Can Learn In-Context Reinforcement Learning

Add code
Jun 26, 2023
Figure 1 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 2 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 3 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 4 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Viaarxiv icon

Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets

Add code
Jun 24, 2023
Figure 1 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Figure 2 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Figure 3 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Figure 4 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Viaarxiv icon

Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task

Add code
Apr 13, 2023
Figure 1 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Figure 2 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Figure 3 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Figure 4 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Viaarxiv icon

Estimating Optimal Policy Value in General Linear Contextual Bandits

Add code
Feb 19, 2023
Figure 1 for Estimating Optimal Policy Value in General Linear Contextual Bandits
Figure 2 for Estimating Optimal Policy Value in General Linear Contextual Bandits
Figure 3 for Estimating Optimal Policy Value in General Linear Contextual Bandits
Viaarxiv icon

Model-based Offline Reinforcement Learning with Local Misspecification

Add code
Jan 26, 2023
Viaarxiv icon

Giving Feedback on Interactive Student Programs with Meta-Exploration

Add code
Nov 16, 2022
Figure 1 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Figure 2 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Figure 3 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Figure 4 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Viaarxiv icon

Oracle Inequalities for Model Selection in Offline Reinforcement Learning

Add code
Nov 03, 2022
Figure 1 for Oracle Inequalities for Model Selection in Offline Reinforcement Learning
Viaarxiv icon

Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data

Add code
Oct 16, 2022
Figure 1 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Figure 2 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Figure 3 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Figure 4 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Viaarxiv icon