Alert button
Picture for Zhengyuan Zhou

Zhengyuan Zhou

Alert button

Breaking the Lower Bound with (Little) Structure: Acceleration in Non-Convex Stochastic Optimization with Heavy-Tailed Noise

Add code
Bookmark button
Alert button
Feb 14, 2023
Zijian Liu, Jiawei Zhang, Zhengyuan Zhou

Viaarxiv icon

Near-Optimal High-Probability Convergence for Non-Convex Stochastic Optimization with Variance Reduction

Add code
Bookmark button
Alert button
Feb 13, 2023
Zijian Liu, Perry Dong, Srikanth Jagabathula, Zhengyuan Zhou

Figure 1 for Near-Optimal High-Probability Convergence for Non-Convex Stochastic Optimization with Variance Reduction
Figure 2 for Near-Optimal High-Probability Convergence for Non-Convex Stochastic Optimization with Variance Reduction
Figure 3 for Near-Optimal High-Probability Convergence for Non-Convex Stochastic Optimization with Variance Reduction
Viaarxiv icon

Single-Trajectory Distributionally Robust Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 27, 2023
Zhipeng Liang, Xiaoteng Ma, Jose Blanchet, Jiheng Zhang, Zhengyuan Zhou

Figure 1 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 2 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 3 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 4 for Single-Trajectory Distributionally Robust Reinforcement Learning
Viaarxiv icon

Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions

Add code
Bookmark button
Alert button
Nov 05, 2022
Wei Zhang, Yanjun Han, Zhengyuan Zhou, Aaron Flores, Tsachy Weissman

Figure 1 for Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions
Figure 2 for Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions
Figure 3 for Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions
Figure 4 for Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions
Viaarxiv icon

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Sep 29, 2022
Xiaoteng Ma, Zhipeng Liang, Jose Blanchet, Mingwen Liu, Li Xia, Jiheng Zhang, Qianchuan Zhao, Zhengyuan Zhou

Figure 1 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Optimal Diagonal Preconditioning: Theory and Practice

Add code
Bookmark button
Alert button
Sep 02, 2022
Zhaonan Qu, Wenzhi Gao, Oliver Hinder, Yinyu Ye, Zhengyuan Zhou

Figure 1 for Optimal Diagonal Preconditioning: Theory and Practice
Figure 2 for Optimal Diagonal Preconditioning: Theory and Practice
Figure 3 for Optimal Diagonal Preconditioning: Theory and Practice
Figure 4 for Optimal Diagonal Preconditioning: Theory and Practice
Viaarxiv icon

Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies

Add code
Bookmark button
Alert button
Jul 10, 2022
Boxiao Chen, Jiashuo Jiang, Jiawei Zhang, Zhengyuan Zhou

Figure 1 for Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies
Figure 2 for Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies
Figure 3 for Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies
Viaarxiv icon

Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning

Add code
Bookmark button
Alert button
Feb 19, 2022
Nathan Kallus, Xiaojie Mao, Kaiwen Wang, Zhengyuan Zhou

Figure 1 for Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Figure 2 for Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Figure 3 for Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Viaarxiv icon

Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback

Add code
Bookmark button
Alert button
Dec 08, 2021
Tianyi Lin, Zhengyuan Zhou, Wenjia Ba, Jiawei Zhang

Figure 1 for Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Figure 2 for Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Figure 3 for Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Figure 4 for Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Viaarxiv icon