Alert button
Picture for Zhengyuan Zhou

Zhengyuan Zhou

Alert button

Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games

Mar 18, 2020
Tianyi Lin, Zhengyuan Zhou, Panayotis Mertikopoulos, Michael I. Jordan

Viaarxiv icon

Multi-action Offline Policy Learning with Bayesian Optimization

Mar 17, 2020
Fang Cai, Zhaonan Qu, Li Xia, Zhengyuan Zhou

Figure 1 for Multi-action Offline Policy Learning with Bayesian Optimization
Figure 2 for Multi-action Offline Policy Learning with Bayesian Optimization
Figure 3 for Multi-action Offline Policy Learning with Bayesian Optimization
Figure 4 for Multi-action Offline Policy Learning with Bayesian Optimization
Viaarxiv icon

Delay-Adaptive Learning in Generalized Linear Contextual Bandits

Mar 11, 2020
Jose Blanchet, Renyuan Xu, Zhengyuan Zhou

Figure 1 for Delay-Adaptive Learning in Generalized Linear Contextual Bandits
Viaarxiv icon

Provably Efficient Reinforcement Learning with Aggregated States

Dec 13, 2019
Shi Dong, Benjamin Van Roy, Zhengyuan Zhou

Viaarxiv icon

Balanced Linear Contextual Bandits

Dec 15, 2018
Maria Dimakopoulou, Zhengyuan Zhou, Susan Athey, Guido Imbens

Figure 1 for Balanced Linear Contextual Bandits
Figure 2 for Balanced Linear Contextual Bandits
Figure 3 for Balanced Linear Contextual Bandits
Figure 4 for Balanced Linear Contextual Bandits
Viaarxiv icon

Offline Multi-Action Policy Learning: Generalization and Optimization

Oct 10, 2018
Zhengyuan Zhou, Susan Athey, Stefan Wager

Figure 1 for Offline Multi-Action Policy Learning: Generalization and Optimization
Figure 2 for Offline Multi-Action Policy Learning: Generalization and Optimization
Figure 3 for Offline Multi-Action Policy Learning: Generalization and Optimization
Figure 4 for Offline Multi-Action Policy Learning: Generalization and Optimization
Viaarxiv icon

MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels

Aug 13, 2018
Lu Jiang, Zhengyuan Zhou, Thomas Leung, Li-Jia Li, Li Fei-Fei

Figure 1 for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Figure 2 for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Figure 3 for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Figure 4 for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Viaarxiv icon

On the convergence of mirror descent beyond stochastic convex programming

Jul 16, 2018
Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Stephen Boyd, Peter Glynn

Figure 1 for On the convergence of mirror descent beyond stochastic convex programming
Figure 2 for On the convergence of mirror descent beyond stochastic convex programming
Figure 3 for On the convergence of mirror descent beyond stochastic convex programming
Viaarxiv icon

Learning in games with continuous action sets and unknown payoff functions

Jan 16, 2018
Panayotis Mertikopoulos, Zhengyuan Zhou

Figure 1 for Learning in games with continuous action sets and unknown payoff functions
Figure 2 for Learning in games with continuous action sets and unknown payoff functions
Viaarxiv icon