Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Victor Bapst

Sample Efficient Actor-Critic with Experience Replay

Jul 10, 2017

Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Remi Munos, Koray Kavukcuoglu, Nando de Freitas

Figure 1 for Sample Efficient Actor-Critic with Experience Replay

Figure 2 for Sample Efficient Actor-Critic with Experience Replay

Figure 3 for Sample Efficient Actor-Critic with Experience Replay

Figure 4 for Sample Efficient Actor-Critic with Experience Replay

Abstract:This paper presents an actor-critic deep reinforcement learning agent with experience replay that is stable, sample efficient, and performs remarkably well on challenging environments, including the discrete 57-game Atari domain and several continuous control problems. To achieve this, the paper introduces several innovations, including truncated importance sampling with bias correction, stochastic dueling network architectures, and a new trust region policy optimization method.

* 20 pages. Prepared for ICLR 2017

Via

Access Paper or Ask Questions