Picture for Zaiwei Chen

Zaiwei Chen

Non-Asymptotic Convergence of Stochastic Iterative Algorithms: A Lyapunov Framework

Add code
May 29, 2026
Viaarxiv icon

Achieving $ε^{-2}$ Sample Complexity for Single-Loop Actor-Critic under Minimal Assumptions

Add code
May 13, 2026
Viaarxiv icon

Natural Policy Gradient as Doubly Smoothed Policy Iteration: A Bellman-Operator Framework

Add code
May 11, 2026
Viaarxiv icon

Bridging the Gap Between Average and Discounted TD Learning

Add code
May 03, 2026
Viaarxiv icon

Natural Hypergradient Descent: Algorithm Design, Convergence Analysis, and Parallel Implementation

Add code
Feb 11, 2026
Viaarxiv icon

Achieving $\varepsilon^{-2}$ Dependence for Average-Reward Q-Learning with a New Contraction Principle

Add code
Jan 29, 2026
Viaarxiv icon

Non-Asymptotic Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes

Add code
Apr 25, 2025
Figure 1 for Non-Asymptotic Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes
Figure 2 for Non-Asymptotic Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes
Viaarxiv icon

A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms

Add code
Feb 20, 2025
Viaarxiv icon

Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

Add code
Sep 02, 2024
Figure 1 for Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games
Viaarxiv icon

Approximate Global Convergence of Independent Learning in Multi-Agent Systems

Add code
May 30, 2024
Viaarxiv icon