Alert button
Picture for Zhaoran Wang

Zhaoran Wang

Alert button

Gap-Dependent Bounds for Two-Player Markov Games

Add code
Bookmark button
Alert button
Jul 01, 2021
Zehao Dou, Zhuoran Yang, Zhaoran Wang, Simon S. Du

Viaarxiv icon

Randomized Exploration for Reinforcement Learning with General Value Function Approximation

Add code
Bookmark button
Alert button
Jun 15, 2021
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin F. Yang

Figure 1 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 2 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 3 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 4 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Viaarxiv icon

Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach

Add code
Bookmark button
Alert button
May 18, 2021
Yan Li, Lingxiao Wang, Jiachen Yang, Ethan Wang, Zhaoran Wang, Tuo Zhao, Hongyuan Zha

Figure 1 for Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach
Figure 2 for Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach
Figure 3 for Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach
Figure 4 for Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach
Viaarxiv icon

Principled Exploration via Optimistic Bootstrapping and Backward Induction

Add code
Bookmark button
Alert button
May 17, 2021
Chenjia Bai, Lingxiao Wang, Lei Han, Jianye Hao, Animesh Garg, Peng Liu, Zhaoran Wang

Figure 1 for Principled Exploration via Optimistic Bootstrapping and Backward Induction
Figure 2 for Principled Exploration via Optimistic Bootstrapping and Backward Induction
Figure 3 for Principled Exploration via Optimistic Bootstrapping and Backward Induction
Figure 4 for Principled Exploration via Optimistic Bootstrapping and Backward Induction
Viaarxiv icon

Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality

Add code
Bookmark button
Alert button
Feb 27, 2021
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang

Figure 1 for Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Figure 2 for Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Figure 3 for Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Viaarxiv icon

Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 19, 2021
Luofeng Liao, Zuyue Fu, Zhuoran Yang, Mladen Kolar, Zhaoran Wang

Figure 1 for Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Figure 2 for Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Viaarxiv icon

A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization

Add code
Bookmark button
Alert button
Feb 15, 2021
Prashant Khanduri, Siliang Zeng, Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang

Figure 1 for A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization
Figure 2 for A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization
Figure 3 for A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization
Figure 4 for A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization
Viaarxiv icon

Provably Training Neural Network Classifiers under Fairness Constraints

Add code
Bookmark button
Alert button
Dec 30, 2020
You-Lin Chen, Zhaoran Wang, Mladen Kolar

Figure 1 for Provably Training Neural Network Classifiers under Fairness Constraints
Figure 2 for Provably Training Neural Network Classifiers under Fairness Constraints
Viaarxiv icon