Picture for Kelly W. Zhang

Kelly W. Zhang

Contextual Thompson Sampling via Generation of Missing Data

Add code
Feb 10, 2025
Viaarxiv icon

Impatient Bandits: Optimizing for the Long-Term Without Delay

Add code
Jan 14, 2025
Figure 1 for Impatient Bandits: Optimizing for the Long-Term Without Delay
Figure 2 for Impatient Bandits: Optimizing for the Long-Term Without Delay
Figure 3 for Impatient Bandits: Optimizing for the Long-Term Without Delay
Figure 4 for Impatient Bandits: Optimizing for the Long-Term Without Delay
Viaarxiv icon

A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial

Add code
Sep 03, 2024
Figure 1 for A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Figure 2 for A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Figure 3 for A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Figure 4 for A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Viaarxiv icon

Oralytics Reinforcement Learning Algorithm

Add code
Jun 19, 2024
Figure 1 for Oralytics Reinforcement Learning Algorithm
Figure 2 for Oralytics Reinforcement Learning Algorithm
Figure 3 for Oralytics Reinforcement Learning Algorithm
Figure 4 for Oralytics Reinforcement Learning Algorithm
Viaarxiv icon

The Fallacy of Minimizing Local Regret in the Sequential Task Setting

Add code
Mar 16, 2024
Figure 1 for The Fallacy of Minimizing Local Regret in the Sequential Task Setting
Viaarxiv icon

Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials

Add code
Feb 26, 2024
Figure 1 for Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials
Viaarxiv icon

Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care

Add code
Aug 15, 2022
Figure 1 for Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
Figure 2 for Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
Figure 3 for Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
Viaarxiv icon

A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes

Add code
Jul 30, 2022
Figure 1 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Figure 2 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Figure 3 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Figure 4 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Viaarxiv icon

Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-implementation Guidelines

Add code
Jun 08, 2022
Figure 1 for Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-implementation Guidelines
Figure 2 for Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-implementation Guidelines
Figure 3 for Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-implementation Guidelines
Figure 4 for Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-implementation Guidelines
Viaarxiv icon

Statistical Inference with M-Estimators on Adaptively Collected Data

Add code
May 28, 2021
Figure 1 for Statistical Inference with M-Estimators on Adaptively Collected Data
Figure 2 for Statistical Inference with M-Estimators on Adaptively Collected Data
Figure 3 for Statistical Inference with M-Estimators on Adaptively Collected Data
Figure 4 for Statistical Inference with M-Estimators on Adaptively Collected Data
Viaarxiv icon