Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

May 08, 2025

Andreas Kontogiannis, Konstantinos Papathanasiou, Yi Shen, Giorgos Stamou, Michael M. Zavlanos, George Vouros

Figure 1 for Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Figure 2 for Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Figure 3 for Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Figure 4 for Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Share this with someone who'll enjoy it:

Abstract:Learning to cooperate in distributed partially observable environments with no communication abilities poses significant challenges for multi-agent deep reinforcement learning (MARL). This paper addresses key concerns in this domain, focusing on inferring state representations from individual agent observations and leveraging these representations to enhance agents' exploration and collaborative task execution policies. To this end, we propose a novel state modelling framework for cooperative MARL, where agents infer meaningful belief representations of the non-observable state, with respect to optimizing their own policies, while filtering redundant and less informative joint state information. Building upon this framework, we propose the MARL SMPE algorithm. In SMPE, agents enhance their own policy's discriminative abilities under partial observability, explicitly by incorporating their beliefs into the policy network, and implicitly by adopting an adversarial type of exploration policies which encourages agents to discover novel, high-value states while improving the discriminative abilities of others. Experimentally, we show that SMPE outperforms state-of-the-art MARL algorithms in complex fully cooperative tasks from the MPE, LBF, and RWARE benchmarks.

* Accepted (Poster) at ICML 2025

View paper on

Share this with someone who'll enjoy it:

Title:Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Paper and Code