Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sanjeevan Ahilan

A Succinct Summary of Reinforcement Learning

Jan 03, 2023

Sanjeevan Ahilan

Abstract:This document is a concise summary of many key results in single-agent reinforcement learning (RL). The intended audience are those who already have some familiarity with RL and are looking to review, reference and/or remind themselves of important ideas in the field.

Via

Access Paper or Ask Questions

SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

Nov 01, 2020

Ming Zhou, Jun Luo, Julian Villella, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen(+27 more)

Figure 1 for SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

Figure 2 for SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

Figure 3 for SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

Figure 4 for SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

Abstract:Multi-agent interaction is a fundamental aspect of autonomous driving in the real world. Despite more than a decade of research and development, the problem of how to competently interact with diverse road users in diverse scenarios remains largely unsolved. Learning methods have much to offer towards solving this problem. But they require a realistic multi-agent simulator that generates diverse and competent driving interactions. To meet this need, we develop a dedicated simulation platform called SMARTS (Scalable Multi-Agent RL Training School). SMARTS supports the training, accumulation, and use of diverse behavior models of road users. These are in turn used to create increasingly more realistic and diverse interactions that enable deeper and broader research on multi-agent interaction. In this paper, we describe the design goals of SMARTS, explain its basic architecture and its key features, and illustrate its use through concrete multi-agent experiments on interactive scenarios. We open-source the SMARTS platform and the associated benchmark tasks and evaluation metrics to encourage and empower research on multi-agent learning for autonomous driving. Our code is available at https://github.com/huawei-noah/SMARTS.

* 20 pages, 11 figures. Paper accepted to CoRL 2020

Via

Access Paper or Ask Questions

Correcting Experience Replay for Multi-Agent Communication

Oct 02, 2020

Sanjeevan Ahilan, Peter Dayan

Figure 1 for Correcting Experience Replay for Multi-Agent Communication

Figure 2 for Correcting Experience Replay for Multi-Agent Communication

Figure 3 for Correcting Experience Replay for Multi-Agent Communication

Figure 4 for Correcting Experience Replay for Multi-Agent Communication

Abstract:We consider the problem of learning to communicate using multi-agent reinforcement learning (MARL). A common approach is to learn off-policy, using data sampled from a replay buffer. However, messages received in the past may not accurately reflect the current communication policy of each agent, and this complicates learning. We therefore introduce a 'communication correction' which accounts for the non-stationarity of observed communication induced by multi-agent learning. It works by relabelling the received message to make it likely under the communicator's current policy, and thus be a better reflection of the receiver's current environment. To account for cases in which agents are both senders and receivers, we introduce an ordered relabelling scheme. Our correction is computationally efficient and can be integrated with a range of off-policy algorithms. It substantially improves the ability of communicating MARL systems to learn across a variety of cooperative and competitive tasks.

Via

Access Paper or Ask Questions

Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning

Jan 24, 2019

Sanjeevan Ahilan, Peter Dayan

Figure 1 for Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning

Figure 2 for Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning

Figure 3 for Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning

Figure 4 for Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning

Abstract:We investigate how reinforcement learning agents can learn to cooperate. Drawing inspiration from human societies, in which successful coordination of many individuals is often facilitated by hierarchical organisation, we introduce Feudal Multi-agent Hierarchies (FMH). In this framework, a 'manager' agent, which is tasked with maximising the environmentally-determined reward function, learns to communicate subgoals to multiple, simultaneously-operating, 'worker' agents. Workers, which are rewarded for achieving managerial subgoals, take concurrent actions in the world. We outline the structure of FMH and demonstrate its potential for decentralised learning and control. We find that, given an adequate set of subgoals from which to choose, FMH performs, and particularly scales, substantially better than cooperative approaches that use a shared reward function.

Via

Access Paper or Ask Questions