Alert button
Picture for Stefanos Leonardos

Stefanos Leonardos

Alert button

AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation

Add code
Bookmark button
Alert button
Nov 03, 2023
Daiki E. Matsunaga, Jongmin Lee, Jaeseok Yoon, Stefanos Leonardos, Pieter Abbeel, Kee-Eung Kim

Viaarxiv icon

Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality

Add code
Bookmark button
Alert button
Jun 24, 2021
Stefanos Leonardos, Georgios Piliouras, Kelly Spendlove

Figure 1 for Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Figure 2 for Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Figure 3 for Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Figure 4 for Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Viaarxiv icon

Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games

Add code
Bookmark button
Alert button
Jun 03, 2021
Stefanos Leonardos, Will Overman, Ioannis Panageas, Georgios Piliouras

Figure 1 for Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Figure 2 for Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Figure 3 for Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Figure 4 for Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Viaarxiv icon