Alert button
Picture for Paul Christiano

Paul Christiano

Alert button

Fine-Tuning Language Models from Human Preferences

Add code
Bookmark button
Alert button
Sep 18, 2019
Daniel M. Ziegler, Nisan Stiennon, Jeffrey Wu, Tom B. Brown, Alec Radford, Dario Amodei, Paul Christiano, Geoffrey Irving

Figure 1 for Fine-Tuning Language Models from Human Preferences
Figure 2 for Fine-Tuning Language Models from Human Preferences
Figure 3 for Fine-Tuning Language Models from Human Preferences
Figure 4 for Fine-Tuning Language Models from Human Preferences
Viaarxiv icon

AI safety via debate

Add code
Bookmark button
Alert button
Oct 22, 2018
Geoffrey Irving, Paul Christiano, Dario Amodei

Figure 1 for AI safety via debate
Figure 2 for AI safety via debate
Figure 3 for AI safety via debate
Figure 4 for AI safety via debate
Viaarxiv icon

Supervising strong learners by amplifying weak experts

Add code
Bookmark button
Alert button
Oct 19, 2018
Paul Christiano, Buck Shlegeris, Dario Amodei

Figure 1 for Supervising strong learners by amplifying weak experts
Figure 2 for Supervising strong learners by amplifying weak experts
Figure 3 for Supervising strong learners by amplifying weak experts
Figure 4 for Supervising strong learners by amplifying weak experts
Viaarxiv icon

Unrestricted Adversarial Examples

Add code
Bookmark button
Alert button
Sep 22, 2018
Tom B. Brown, Nicholas Carlini, Chiyuan Zhang, Catherine Olsson, Paul Christiano, Ian Goodfellow

Figure 1 for Unrestricted Adversarial Examples
Figure 2 for Unrestricted Adversarial Examples
Viaarxiv icon

Deep reinforcement learning from human preferences

Add code
Bookmark button
Alert button
Jul 13, 2017
Paul Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei

Figure 1 for Deep reinforcement learning from human preferences
Figure 2 for Deep reinforcement learning from human preferences
Figure 3 for Deep reinforcement learning from human preferences
Figure 4 for Deep reinforcement learning from human preferences
Viaarxiv icon

A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models

Add code
Bookmark button
Alert button
Nov 25, 2016
Chelsea Finn, Paul Christiano, Pieter Abbeel, Sergey Levine

Viaarxiv icon

Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model

Add code
Bookmark button
Alert button
Oct 11, 2016
Paul Christiano, Zain Shah, Igor Mordatch, Jonas Schneider, Trevor Blackwell, Joshua Tobin, Pieter Abbeel, Wojciech Zaremba

Figure 1 for Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model
Figure 2 for Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model
Figure 3 for Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model
Figure 4 for Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model
Viaarxiv icon

Concrete Problems in AI Safety

Add code
Bookmark button
Alert button
Jul 25, 2016
Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dan Mané

Viaarxiv icon

Theano: A Python framework for fast computation of mathematical expressions

Add code
Bookmark button
Alert button
May 09, 2016
The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano, Tim Cooijmans, Marc-Alexandre Côté, Myriam Côté, Aaron Courville, Yann N. Dauphin, Olivier Delalleau, Julien Demouth, Guillaume Desjardins, Sander Dieleman, Laurent Dinh, Mélanie Ducoffe, Vincent Dumoulin, Samira Ebrahimi Kahou, Dumitru Erhan, Ziye Fan, Orhan Firat, Mathieu Germain, Xavier Glorot, Ian Goodfellow, Matt Graham, Caglar Gulcehre, Philippe Hamel, Iban Harlouchet, Jean-Philippe Heng, Balázs Hidasi, Sina Honari, Arjun Jain, Sébastien Jean, Kai Jia, Mikhail Korobov, Vivek Kulkarni, Alex Lamb, Pascal Lamblin, Eric Larsen, César Laurent, Sean Lee, Simon Lefrancois, Simon Lemieux, Nicholas Léonard, Zhouhan Lin, Jesse A. Livezey, Cory Lorenz, Jeremiah Lowin, Qianli Ma, Pierre-Antoine Manzagol, Olivier Mastropietro, Robert T. McGibbon, Roland Memisevic, Bart van Merriënboer, Vincent Michalski, Mehdi Mirza, Alberto Orlandi, Christopher Pal, Razvan Pascanu, Mohammad Pezeshki, Colin Raffel, Daniel Renshaw, Matthew Rocklin, Adriana Romero, Markus Roth, Peter Sadowski, John Salvatier, François Savard, Jan Schlüter, John Schulman, Gabriel Schwartz, Iulian Vlad Serban, Dmitriy Serdyuk, Samira Shabanian, Étienne Simon, Sigurd Spieckermann, S. Ramana Subramanyam, Jakub Sygnowski, Jérémie Tanguay, Gijs van Tulder, Joseph Turian, Sebastian Urban, Pascal Vincent, Francesco Visin, Harm de Vries, David Warde-Farley, Dustin J. Webb, Matthew Willson, Kelvin Xu, Lijun Xue, Li Yao, Saizheng Zhang, Ying Zhang

Figure 1 for Theano: A Python framework for fast computation of mathematical expressions
Figure 2 for Theano: A Python framework for fast computation of mathematical expressions
Figure 3 for Theano: A Python framework for fast computation of mathematical expressions
Figure 4 for Theano: A Python framework for fast computation of mathematical expressions
Viaarxiv icon

Collaborative prediction with expert advice

Add code
Bookmark button
Alert button
Apr 08, 2016
Paul Christiano

Viaarxiv icon