Picture for Zhengyuan Zhou

Zhengyuan Zhou

Distributed stochastic optimization with large delays

Add code
Jul 06, 2021
Figure 1 for Distributed stochastic optimization with large delays
Figure 2 for Distributed stochastic optimization with large delays
Viaarxiv icon

Policy Learning with Adaptively Collected Data

Add code
May 05, 2021
Figure 1 for Policy Learning with Adaptively Collected Data
Figure 2 for Policy Learning with Adaptively Collected Data
Figure 3 for Policy Learning with Adaptively Collected Data
Figure 4 for Policy Learning with Adaptively Collected Data
Viaarxiv icon

Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State

Add code
Mar 08, 2021
Figure 1 for Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State
Viaarxiv icon

No Discounted-Regret Learning in Adversarial Bandits with Delays

Add code
Mar 08, 2021
Figure 1 for No Discounted-Regret Learning in Adversarial Bandits with Delays
Figure 2 for No Discounted-Regret Learning in Adversarial Bandits with Delays
Figure 3 for No Discounted-Regret Learning in Adversarial Bandits with Delays
Viaarxiv icon

Doubly-Adaptive Thompson Sampling for Multi-Armed and Contextual Bandits

Add code
Feb 25, 2021
Figure 1 for Doubly-Adaptive Thompson Sampling for Multi-Armed and Contextual Bandits
Figure 2 for Doubly-Adaptive Thompson Sampling for Multi-Armed and Contextual Bandits
Figure 3 for Doubly-Adaptive Thompson Sampling for Multi-Armed and Contextual Bandits
Viaarxiv icon

Federated LQR: Learning through Sharing

Add code
Nov 03, 2020
Figure 1 for Federated LQR: Learning through Sharing
Figure 2 for Federated LQR: Learning through Sharing
Figure 3 for Federated LQR: Learning through Sharing
Figure 4 for Federated LQR: Learning through Sharing
Viaarxiv icon

Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits

Add code
Aug 28, 2020
Figure 1 for Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits
Figure 2 for Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits
Viaarxiv icon

Federated Learning's Blessing: FedAvg has Linear Speedup

Add code
Jul 11, 2020
Figure 1 for Federated Learning's Blessing: FedAvg has Linear Speedup
Figure 2 for Federated Learning's Blessing: FedAvg has Linear Speedup
Figure 3 for Federated Learning's Blessing: FedAvg has Linear Speedup
Figure 4 for Federated Learning's Blessing: FedAvg has Linear Speedup
Viaarxiv icon

Learning to Bid Optimally and Efficiently in Adversarial First-price Auctions

Add code
Jul 09, 2020
Figure 1 for Learning to Bid Optimally and Efficiently in Adversarial First-price Auctions
Figure 2 for Learning to Bid Optimally and Efficiently in Adversarial First-price Auctions
Figure 3 for Learning to Bid Optimally and Efficiently in Adversarial First-price Auctions
Figure 4 for Learning to Bid Optimally and Efficiently in Adversarial First-price Auctions
Viaarxiv icon

Distributional Robust Batch Contextual Bandits

Add code
Jun 10, 2020
Figure 1 for Distributional Robust Batch Contextual Bandits
Figure 2 for Distributional Robust Batch Contextual Bandits
Figure 3 for Distributional Robust Batch Contextual Bandits
Figure 4 for Distributional Robust Batch Contextual Bandits
Viaarxiv icon