Picture for Zhengyuan Zhou

Zhengyuan Zhou

Adaptively Learning to Select-Rank in Online Platforms

Add code
Jun 07, 2024
Viaarxiv icon

Simulation-Based Benchmarking of Reinforcement Learning Agents for Personalized Retail Promotions

Add code
May 16, 2024
Viaarxiv icon

On the Last-Iterate Convergence of Shuffling Gradient Methods

Mar 12, 2024
Figure 1 for On the Last-Iterate Convergence of Shuffling Gradient Methods
Figure 2 for On the Last-Iterate Convergence of Shuffling Gradient Methods
Figure 3 for On the Last-Iterate Convergence of Shuffling Gradient Methods
Viaarxiv icon

Stochastic contextual bandits with graph feedback: from independence number to MAS number

Feb 12, 2024
Viaarxiv icon

Revisiting the Last-Iterate Convergence of Stochastic Gradient Methods

Dec 13, 2023
Viaarxiv icon

On the Foundation of Distributionally Robust Reinforcement Learning

Nov 15, 2023
Viaarxiv icon

Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Oct 24, 2023
Viaarxiv icon

Sample Complexity of Variance-reduced Distributionally Robust Q-learning

May 28, 2023
Figure 1 for Sample Complexity of Variance-reduced Distributionally Robust Q-learning
Figure 2 for Sample Complexity of Variance-reduced Distributionally Robust Q-learning
Figure 3 for Sample Complexity of Variance-reduced Distributionally Robust Q-learning
Figure 4 for Sample Complexity of Variance-reduced Distributionally Robust Q-learning
Viaarxiv icon

Stochastic Nonsmooth Convex Optimization with Heavy-Tailed Noises

Mar 25, 2023
Viaarxiv icon

A Finite Sample Complexity Bound for Distributionally Robust Q-learning

Mar 03, 2023
Figure 1 for A Finite Sample Complexity Bound for Distributionally Robust Q-learning
Figure 2 for A Finite Sample Complexity Bound for Distributionally Robust Q-learning
Figure 3 for A Finite Sample Complexity Bound for Distributionally Robust Q-learning
Figure 4 for A Finite Sample Complexity Bound for Distributionally Robust Q-learning
Viaarxiv icon