Alert button
Picture for Thinh T. Doan

Thinh T. Doan

Alert button

Fast Nonlinear Two-Time-Scale Stochastic Approximation: Achieving $O(1/k)$ Finite-Sample Complexity

Add code
Bookmark button
Alert button
Jan 24, 2024
Thinh T. Doan

Viaarxiv icon

Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems

Add code
Bookmark button
Alert button
Mar 23, 2023
Sihan Zeng, Thinh T. Doan, Justin Romberg

Figure 1 for Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems
Figure 2 for Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems
Viaarxiv icon

Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games

Add code
Bookmark button
Alert button
Jun 15, 2022
Dingyang Chen, Qi Zhang, Thinh T. Doan

Figure 1 for Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games
Figure 2 for Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games
Figure 3 for Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games
Figure 4 for Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games
Viaarxiv icon

Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games

Add code
Bookmark button
Alert button
May 27, 2022
Sihan Zeng, Thinh T. Doan, Justin Romberg

Figure 1 for Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games
Figure 2 for Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games
Viaarxiv icon

Convergence Rates of Two-Time-Scale Gradient Descent-Ascent Dynamics for Solving Nonconvex Min-Max Problems

Add code
Bookmark button
Alert button
Dec 17, 2021
Thinh T. Doan

Figure 1 for Convergence Rates of Two-Time-Scale Gradient Descent-Ascent Dynamics for Solving Nonconvex Min-Max Problems
Viaarxiv icon

Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes

Add code
Bookmark button
Alert button
Oct 21, 2021
Sihan Zeng, Thinh T. Doan, Justin Romberg

Figure 1 for Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes
Figure 2 for Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes
Viaarxiv icon

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 01, 2021
Sihan Zeng, Thinh T. Doan, Justin Romberg

Figure 1 for A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
Figure 2 for A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
Viaarxiv icon

Byzantine Fault-Tolerance in Federated Local SGD under 2f-Redundancy

Add code
Bookmark button
Alert button
Aug 26, 2021
Nirupam Gupta, Thinh T. Doan, Nitin Vaidya

Figure 1 for Byzantine Fault-Tolerance in Federated Local SGD under 2f-Redundancy
Figure 2 for Byzantine Fault-Tolerance in Federated Local SGD under 2f-Redundancy
Viaarxiv icon

Finite-Time Convergence Rates of Nonlinear Two-Time-Scale Stochastic Approximation under Markovian Noise

Add code
Bookmark button
Alert button
Apr 04, 2021
Thinh T. Doan

Viaarxiv icon