Picture for Zhengyuan Zhou

Zhengyuan Zhou

Distributional Soft Actor Critic for Risk Sensitive Learning

Add code
Apr 30, 2020
Figure 1 for Distributional Soft Actor Critic for Risk Sensitive Learning
Figure 2 for Distributional Soft Actor Critic for Risk Sensitive Learning
Figure 3 for Distributional Soft Actor Critic for Risk Sensitive Learning
Figure 4 for Distributional Soft Actor Critic for Risk Sensitive Learning
Viaarxiv icon

Sequential Batch Learning in Finite-Action Linear Contextual Bandits

Add code
Apr 14, 2020
Viaarxiv icon

Optimal No-regret Learning in Repeated First-price Auctions

Add code
Apr 14, 2020
Figure 1 for Optimal No-regret Learning in Repeated First-price Auctions
Viaarxiv icon

Diagonal Preconditioning: Theory and Algorithms

Add code
Mar 25, 2020
Figure 1 for Diagonal Preconditioning: Theory and Algorithms
Figure 2 for Diagonal Preconditioning: Theory and Algorithms
Figure 3 for Diagonal Preconditioning: Theory and Algorithms
Figure 4 for Diagonal Preconditioning: Theory and Algorithms
Viaarxiv icon

Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games

Add code
Mar 18, 2020
Viaarxiv icon

Delay-Adaptive Learning in Generalized Linear Contextual Bandits

Add code
Mar 11, 2020
Figure 1 for Delay-Adaptive Learning in Generalized Linear Contextual Bandits
Viaarxiv icon

Provably Efficient Reinforcement Learning with Aggregated States

Add code
Dec 13, 2019
Viaarxiv icon

Balanced Linear Contextual Bandits

Add code
Dec 15, 2018
Figure 1 for Balanced Linear Contextual Bandits
Figure 2 for Balanced Linear Contextual Bandits
Figure 3 for Balanced Linear Contextual Bandits
Figure 4 for Balanced Linear Contextual Bandits
Viaarxiv icon

Offline Multi-Action Policy Learning: Generalization and Optimization

Add code
Oct 10, 2018
Figure 1 for Offline Multi-Action Policy Learning: Generalization and Optimization
Figure 2 for Offline Multi-Action Policy Learning: Generalization and Optimization
Figure 3 for Offline Multi-Action Policy Learning: Generalization and Optimization
Figure 4 for Offline Multi-Action Policy Learning: Generalization and Optimization
Viaarxiv icon

MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels

Add code
Aug 13, 2018
Figure 1 for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Figure 2 for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Figure 3 for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Figure 4 for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Viaarxiv icon