Picture for Xuedong Shang

Xuedong Shang

Price of Safety in Linear Best Arm Identification

Add code
Sep 15, 2023
Figure 1 for Price of Safety in Linear Best Arm Identification
Viaarxiv icon

UCB Momentum Q-learning: Correcting the bias without forgetting

Add code
Mar 01, 2021
Figure 1 for UCB Momentum Q-learning: Correcting the bias without forgetting
Figure 2 for UCB Momentum Q-learning: Correcting the bias without forgetting
Figure 3 for UCB Momentum Q-learning: Correcting the bias without forgetting
Viaarxiv icon

Stochastic Bandits with Vector Losses: Minimizing $\ell^\infty$-Norm of Relative Losses

Add code
Oct 15, 2020
Viaarxiv icon

Gamification of Pure Exploration for Linear Bandits

Add code
Jul 02, 2020
Figure 1 for Gamification of Pure Exploration for Linear Bandits
Figure 2 for Gamification of Pure Exploration for Linear Bandits
Figure 3 for Gamification of Pure Exploration for Linear Bandits
Figure 4 for Gamification of Pure Exploration for Linear Bandits
Viaarxiv icon

Fixed-Confidence Guarantees for Bayesian Best-Arm Identification

Add code
Oct 28, 2019
Figure 1 for Fixed-Confidence Guarantees for Bayesian Best-Arm Identification
Figure 2 for Fixed-Confidence Guarantees for Bayesian Best-Arm Identification
Viaarxiv icon