Alert button
Picture for Zaiwei Chen

Zaiwei Chen

Alert button

Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games

Add code
Bookmark button
Alert button
Dec 08, 2023
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman

Viaarxiv icon

Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise

Add code
Bookmark button
Alert button
Mar 28, 2023
Zaiwei Chen, Siva Theja Maguluri, Martin Zubeldia

Figure 1 for Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise
Figure 2 for Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise
Figure 3 for Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise
Viaarxiv icon

Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games

Add code
Bookmark button
Alert button
Mar 08, 2023
Zhaoyi Zhou, Zaiwei Chen, Yiheng Lin, Adam Wierman

Figure 1 for Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games
Figure 2 for Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games
Viaarxiv icon

A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

Add code
Bookmark button
Alert button
Mar 03, 2023
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman

Viaarxiv icon

Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 30, 2022
Yizhou Zhang, Guannan Qu, Pan Xu, Yiheng Lin, Zaiwei Chen, Adam Wierman

Figure 1 for Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Figure 2 for Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Figure 3 for Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Viaarxiv icon

Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation

Add code
Bookmark button
Alert button
Aug 05, 2022
Zaiwei Chen, Siva Theja Maguluri

Viaarxiv icon

Target Network and Truncation Overcome The Deadly triad in $Q$-Learning

Add code
Bookmark button
Alert button
Mar 05, 2022
Zaiwei Chen, John Paul Clarke, Siva Theja Maguluri

Figure 1 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 2 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 3 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 4 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Viaarxiv icon

Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization

Add code
Bookmark button
Alert button
Nov 11, 2021
Zaiwei Chen, Shancong Mou, Siva Theja Maguluri

Figure 1 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Figure 2 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Figure 3 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Figure 4 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Viaarxiv icon

Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators

Add code
Bookmark button
Alert button
Jun 24, 2021
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam

Viaarxiv icon

Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation

Add code
Bookmark button
Alert button
May 26, 2021
Zaiwei Chen, Sajad Khodadadian, Siva Theja Maguluri

Figure 1 for Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation
Viaarxiv icon