Alert button

Scalable Reinforcement Learning Policies for Multi-Agent Control

Nov 16, 2020
Christopher D. Hsu, Heejin Jeong, George J. Pappas, Pratik Chaudhari

Figure 1 for Scalable Reinforcement Learning Policies for Multi-Agent Control
Figure 2 for Scalable Reinforcement Learning Policies for Multi-Agent Control
Figure 3 for Scalable Reinforcement Learning Policies for Multi-Agent Control
Figure 4 for Scalable Reinforcement Learning Policies for Multi-Agent Control

Share this with someone who'll enjoy it:

This paper develops a stochastic Multi-Agent Reinforcement Learning (MARL) method to learn control policies that can handle an arbitrary number of external agents; our policies can be executed for tasks consisting of 1000 pursuers and 1000 evaders. We model pursuers as agents with limited on-board sensing and formulate the problem as a decentralized, partially-observable Markov Decision Process. An attention mechanism is used to build a permutation and input-size invariant embedding of the observations for learning a stochastic policy and value function using techniques in entropy-regularized off-policy methods. Simulation experiments on a large number of problems show that our control policies are dramatically scalable and display cooperative behavior in spite of being executed in a decentralized fashion; our methods offer a simple solution to classical multi-agent problems using techniques in reinforcement learning.

* 8 pages, 10 figures, submitted to RA-L with ICRA option  
View paper onarxiv icon

Share this with someone who'll enjoy it: