Picture for Zhengyuan Zhou

Zhengyuan Zhou

Breaking the Lower Bound with (Little) Structure: Acceleration in Non-Convex Stochastic Optimization with Heavy-Tailed Noise

Add code
Feb 14, 2023
Viaarxiv icon

Near-Optimal High-Probability Convergence for Non-Convex Stochastic Optimization with Variance Reduction

Add code
Feb 13, 2023
Viaarxiv icon

Single-Trajectory Distributionally Robust Reinforcement Learning

Add code
Jan 27, 2023
Figure 1 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 2 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 3 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 4 for Single-Trajectory Distributionally Robust Reinforcement Learning
Viaarxiv icon

Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions

Add code
Nov 05, 2022
Figure 1 for Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions
Figure 2 for Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions
Figure 3 for Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions
Figure 4 for Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions
Viaarxiv icon

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Add code
Sep 29, 2022
Figure 1 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Optimal Diagonal Preconditioning: Theory and Practice

Add code
Sep 02, 2022
Figure 1 for Optimal Diagonal Preconditioning: Theory and Practice
Figure 2 for Optimal Diagonal Preconditioning: Theory and Practice
Figure 3 for Optimal Diagonal Preconditioning: Theory and Practice
Figure 4 for Optimal Diagonal Preconditioning: Theory and Practice
Viaarxiv icon

Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies

Add code
Jul 10, 2022
Viaarxiv icon

Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning

Add code
Feb 19, 2022
Figure 1 for Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Figure 2 for Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Figure 3 for Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Viaarxiv icon

Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback

Add code
Dec 08, 2021
Figure 1 for Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Figure 2 for Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Figure 3 for Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Figure 4 for Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Viaarxiv icon

Computational Benefits of Intermediate Rewards for Hierarchical Planning

Add code
Jul 08, 2021
Figure 1 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Figure 2 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Figure 3 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Figure 4 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Viaarxiv icon