Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Backplay: "Man muss immer umkehren"

Sep 28, 2018

Cinjon Resnick, Roberta Raileanu, Sanyam Kapoor, Alexander Peysakhovich, Kyunghyun Cho, Joan Bruna

Figure 1 for Backplay: "Man muss immer umkehren"

Figure 2 for Backplay: "Man muss immer umkehren"

Figure 3 for Backplay: "Man muss immer umkehren"

Figure 4 for Backplay: "Man muss immer umkehren"

Share this with someone who'll enjoy it:

Abstract:A long-standing problem in model-free reinforcement learning (RL) is that it requires a large number of trials to learn a good policy, especially in environments with sparse rewards. We explore a method to increase the sample efficiency of RL when we have access to demonstrations. Our approach, Backplay, uses a single demonstration to construct a curriculum for a given task. Rather than starting each training episode in the environment's fixed initial state, we start the agent near the end of the demonstration and move the starting point backwards during the course of training until we reach the initial state. We perform experiments in a competitive four-player game (Pommerman) and a path-finding maze game. We find that Backplay provides significant gains in sample complexity with a stark advantage in sparse reward settings. In some cases, it reached success rates greater than 50 and generalized to unseen initial conditions, while standard RL did not yield any improvement.

View paper on

Share this with someone who'll enjoy it:

Title:Backplay: "Man muss immer umkehren"

Paper and Code