Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence

Jun 17, 2021

Runyu Zhang, Zhaolin Ren, Na Li

Figure 1 for Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence

Figure 2 for Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence

Share this with someone who'll enjoy it:

Abstract:We study the performance of the gradient play algorithm for multi-agent tabular Markov decision processes (MDPs), which are also known as stochastic games (SGs), where each agent tries to maximize its own total discounted reward by making decisions independently based on current state information which is shared between agents. Policies are directly parameterized by the probability of choosing a certain action at a given state. We show that Nash equilibria (NEs) and first order stationary policies are equivalent in this setting, and give a non-asymptotic global convergence rate analysis to an $\epsilon$-NE for a subclass of multi-agent MDPs called Markov potential games, which includes the cooperative setting with identical rewards among agents as an important special case. Our result shows that the number of iterations to reach an $\epsilon$-NE scales linearly, instead of exponentially, with the number of agents. Local geometry and local stability are also considered. For Markov potential games, we prove that strict NEs are local maxima of the total potential function and fully-mixed NEs are saddle points. We also give a local convergence rate around strict NEs for more general settings.

View paper on

Share this with someone who'll enjoy it:

Title:Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence

Paper and Code