Alert button
Picture for John Schulman

John Schulman

Alert button

Proximal Policy Optimization Algorithms

Aug 28, 2017
John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, Oleg Klimov

Figure 1 for Proximal Policy Optimization Algorithms
Figure 2 for Proximal Policy Optimization Algorithms
Figure 3 for Proximal Policy Optimization Algorithms
Figure 4 for Proximal Policy Optimization Algorithms
Viaarxiv icon

Trust Region Policy Optimization

Apr 20, 2017
John Schulman, Sergey Levine, Philipp Moritz, Michael I. Jordan, Pieter Abbeel

Figure 1 for Trust Region Policy Optimization
Figure 2 for Trust Region Policy Optimization
Viaarxiv icon

Variational Lossy Autoencoder

Mar 04, 2017
Xi Chen, Diederik P. Kingma, Tim Salimans, Yan Duan, Prafulla Dhariwal, John Schulman, Ilya Sutskever, Pieter Abbeel

Figure 1 for Variational Lossy Autoencoder
Figure 2 for Variational Lossy Autoencoder
Figure 3 for Variational Lossy Autoencoder
Figure 4 for Variational Lossy Autoencoder
Viaarxiv icon

VIME: Variational Information Maximizing Exploration

Jan 27, 2017
Rein Houthooft, Xi Chen, Yan Duan, John Schulman, Filip De Turck, Pieter Abbeel

Figure 1 for VIME: Variational Information Maximizing Exploration
Figure 2 for VIME: Variational Information Maximizing Exploration
Figure 3 for VIME: Variational Information Maximizing Exploration
Figure 4 for VIME: Variational Information Maximizing Exploration
Viaarxiv icon

RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning

Nov 10, 2016
Yan Duan, John Schulman, Xi Chen, Peter L. Bartlett, Ilya Sutskever, Pieter Abbeel

Figure 1 for RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
Figure 2 for RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
Figure 3 for RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
Figure 4 for RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
Viaarxiv icon

Concrete Problems in AI Safety

Jul 25, 2016
Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dan Mané

Viaarxiv icon

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

Jun 12, 2016
Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel

Figure 1 for InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Figure 2 for InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Figure 3 for InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Figure 4 for InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Viaarxiv icon

OpenAI Gym

Jun 05, 2016
Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, Wojciech Zaremba

Figure 1 for OpenAI Gym
Viaarxiv icon

Benchmarking Deep Reinforcement Learning for Continuous Control

May 27, 2016
Yan Duan, Xi Chen, Rein Houthooft, John Schulman, Pieter Abbeel

Figure 1 for Benchmarking Deep Reinforcement Learning for Continuous Control
Figure 2 for Benchmarking Deep Reinforcement Learning for Continuous Control
Figure 3 for Benchmarking Deep Reinforcement Learning for Continuous Control
Figure 4 for Benchmarking Deep Reinforcement Learning for Continuous Control
Viaarxiv icon

Theano: A Python framework for fast computation of mathematical expressions

May 09, 2016
The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano, Tim Cooijmans, Marc-Alexandre Côté, Myriam Côté, Aaron Courville, Yann N. Dauphin, Olivier Delalleau, Julien Demouth, Guillaume Desjardins, Sander Dieleman, Laurent Dinh, Mélanie Ducoffe, Vincent Dumoulin, Samira Ebrahimi Kahou, Dumitru Erhan, Ziye Fan, Orhan Firat, Mathieu Germain, Xavier Glorot, Ian Goodfellow, Matt Graham, Caglar Gulcehre, Philippe Hamel, Iban Harlouchet, Jean-Philippe Heng, Balázs Hidasi, Sina Honari, Arjun Jain, Sébastien Jean, Kai Jia, Mikhail Korobov, Vivek Kulkarni, Alex Lamb, Pascal Lamblin, Eric Larsen, César Laurent, Sean Lee, Simon Lefrancois, Simon Lemieux, Nicholas Léonard, Zhouhan Lin, Jesse A. Livezey, Cory Lorenz, Jeremiah Lowin, Qianli Ma, Pierre-Antoine Manzagol, Olivier Mastropietro, Robert T. McGibbon, Roland Memisevic, Bart van Merriënboer, Vincent Michalski, Mehdi Mirza, Alberto Orlandi, Christopher Pal, Razvan Pascanu, Mohammad Pezeshki, Colin Raffel, Daniel Renshaw, Matthew Rocklin, Adriana Romero, Markus Roth, Peter Sadowski, John Salvatier, François Savard, Jan Schlüter, John Schulman, Gabriel Schwartz, Iulian Vlad Serban, Dmitriy Serdyuk, Samira Shabanian, Étienne Simon, Sigurd Spieckermann, S. Ramana Subramanyam, Jakub Sygnowski, Jérémie Tanguay, Gijs van Tulder, Joseph Turian, Sebastian Urban, Pascal Vincent, Francesco Visin, Harm de Vries, David Warde-Farley, Dustin J. Webb, Matthew Willson, Kelvin Xu, Lijun Xue, Li Yao, Saizheng Zhang, Ying Zhang

Figure 1 for Theano: A Python framework for fast computation of mathematical expressions
Figure 2 for Theano: A Python framework for fast computation of mathematical expressions
Figure 3 for Theano: A Python framework for fast computation of mathematical expressions
Figure 4 for Theano: A Python framework for fast computation of mathematical expressions
Viaarxiv icon