Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Oct 06, 2020

Tarun Gupta, Anuj Mahajan, Bei Peng, Wendelin Böhmer, Shimon Whiteson

Figure 1 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Figure 2 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Figure 3 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Figure 4 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:This paper focuses on cooperative value-based multi-agent reinforcement learning (MARL) in the paradigm of centralized training with decentralized execution (CTDE). Current state-of-the-art value-based MARL methods leverage CTDE to learn a centralized joint-action value function as a monotonic mixing of each agent's utility function, which enables easy decentralization. However, this monotonic restriction leads to inefficient exploration in tasks with nonmonotonic returns due to suboptimal approximations of the values of joint actions. To address this, we present a novel MARL approach called Universal Value Exploration (UneVEn), which uses universal successor features (USFs) to learn policies of tasks related to the target task, but with simpler reward functions in a sample efficient manner. UneVEn uses novel action-selection schemes between randomly sampled related tasks during exploration, which enables the monotonic joint-action value function of the target task to place more importance on useful joint actions. Empirical results on a challenging cooperative predator-prey task requiring significant coordination amongst agents show that UneVEn significantly outperforms state-of-the-art baselines.

* Under review

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Paper and Code