Alert button
Picture for Zhengyuan Zhou

Zhengyuan Zhou

Alert button

Computational Benefits of Intermediate Rewards for Hierarchical Planning

Add code
Bookmark button
Alert button
Jul 08, 2021
Yuexiang Zhai, Christina Baek, Zhengyuan Zhou, Jiantao Jiao, Yi Ma

Figure 1 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Figure 2 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Figure 3 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Figure 4 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Viaarxiv icon

Distributed stochastic optimization with large delays

Add code
Bookmark button
Alert button
Jul 06, 2021
Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Peter W. Glynn, Yinyu Ye

Figure 1 for Distributed stochastic optimization with large delays
Figure 2 for Distributed stochastic optimization with large delays
Viaarxiv icon

Policy Learning with Adaptively Collected Data

Add code
Bookmark button
Alert button
May 05, 2021
Ruohan Zhan, Zhimei Ren, Susan Athey, Zhengyuan Zhou

Figure 1 for Policy Learning with Adaptively Collected Data
Figure 2 for Policy Learning with Adaptively Collected Data
Figure 3 for Policy Learning with Adaptively Collected Data
Figure 4 for Policy Learning with Adaptively Collected Data
Viaarxiv icon

Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State

Add code
Bookmark button
Alert button
Mar 08, 2021
Shi Dong, Benjamin Van Roy, Zhengyuan Zhou

Figure 1 for Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State
Viaarxiv icon

No Discounted-Regret Learning in Adversarial Bandits with Delays

Add code
Bookmark button
Alert button
Mar 08, 2021
Ilai Bistritz, Zhengyuan Zhou, Xi Chen, Nicholas Bambos, Jose Blanchet

Figure 1 for No Discounted-Regret Learning in Adversarial Bandits with Delays
Figure 2 for No Discounted-Regret Learning in Adversarial Bandits with Delays
Figure 3 for No Discounted-Regret Learning in Adversarial Bandits with Delays
Viaarxiv icon

Doubly-Adaptive Thompson Sampling for Multi-Armed and Contextual Bandits

Add code
Bookmark button
Alert button
Feb 25, 2021
Maria Dimakopoulou, Zhimei Ren, Zhengyuan Zhou

Figure 1 for Doubly-Adaptive Thompson Sampling for Multi-Armed and Contextual Bandits
Figure 2 for Doubly-Adaptive Thompson Sampling for Multi-Armed and Contextual Bandits
Figure 3 for Doubly-Adaptive Thompson Sampling for Multi-Armed and Contextual Bandits
Viaarxiv icon

Federated LQR: Learning through Sharing

Add code
Bookmark button
Alert button
Nov 03, 2020
Zhaolin Ren, Aoxiao Zhong, Zhengyuan Zhou, Na Li

Figure 1 for Federated LQR: Learning through Sharing
Figure 2 for Federated LQR: Learning through Sharing
Figure 3 for Federated LQR: Learning through Sharing
Figure 4 for Federated LQR: Learning through Sharing
Viaarxiv icon

Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits

Add code
Bookmark button
Alert button
Aug 28, 2020
Zhimei Ren, Zhengyuan Zhou

Figure 1 for Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits
Figure 2 for Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits
Viaarxiv icon