Picture for Dongruo Zhou

Dongruo Zhou

Learning Contextual Bandits Through Perturbed Rewards

Add code
Jan 24, 2022
Figure 1 for Learning Contextual Bandits Through Perturbed Rewards
Figure 2 for Learning Contextual Bandits Through Perturbed Rewards
Figure 3 for Learning Contextual Bandits Through Perturbed Rewards
Figure 4 for Learning Contextual Bandits Through Perturbed Rewards
Viaarxiv icon

Faster Perturbed Stochastic Gradient Methods for Finding Local Minima

Add code
Oct 25, 2021
Figure 1 for Faster Perturbed Stochastic Gradient Methods for Finding Local Minima
Figure 2 for Faster Perturbed Stochastic Gradient Methods for Finding Local Minima
Viaarxiv icon

Linear Contextual Bandits with Adversarial Corruptions

Add code
Oct 25, 2021
Figure 1 for Linear Contextual Bandits with Adversarial Corruptions
Viaarxiv icon

Iterative Teacher-Aware Learning

Add code
Oct 17, 2021
Figure 1 for Iterative Teacher-Aware Learning
Figure 2 for Iterative Teacher-Aware Learning
Figure 3 for Iterative Teacher-Aware Learning
Figure 4 for Iterative Teacher-Aware Learning
Viaarxiv icon

Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation

Add code
Oct 12, 2021
Figure 1 for Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation
Figure 2 for Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Pure Exploration in Kernel and Neural Bandits

Add code
Jun 22, 2021
Figure 1 for Pure Exploration in Kernel and Neural Bandits
Figure 2 for Pure Exploration in Kernel and Neural Bandits
Figure 3 for Pure Exploration in Kernel and Neural Bandits
Viaarxiv icon

Variance-Aware Off-Policy Evaluation with Linear Function Approximation

Add code
Jun 22, 2021
Figure 1 for Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Figure 2 for Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Figure 3 for Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Figure 4 for Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Viaarxiv icon

Provably Efficient Representation Learning in Low-rank Markov Decision Processes

Add code
Jun 22, 2021
Viaarxiv icon

Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation

Add code
Jun 22, 2021
Viaarxiv icon

Batched Neural Bandits

Add code
Feb 25, 2021
Figure 1 for Batched Neural Bandits
Figure 2 for Batched Neural Bandits
Figure 3 for Batched Neural Bandits
Figure 4 for Batched Neural Bandits
Viaarxiv icon