Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Jan Leike

Recursively Summarizing Books with Human Feedback


Sep 27, 2021
Jeff Wu, Long Ouyang, Daniel M. Ziegler, Nisan Stiennon, Ryan Lowe, Jan Leike, Paul Christiano


  Access Paper or Ask Questions

Evaluating Large Language Models Trained on Code


Jul 14, 2021
Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-Voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob McGrew, Dario Amodei, Sam McCandlish, Ilya Sutskever, Wojciech Zaremba

* corrected typos, added references, added authors, added acknowledgements 

  Access Paper or Ask Questions

Institutionalising Ethics in AI through Broader Impact Requirements


May 30, 2021
Carina Prunkl, Carolyn Ashurst, Markus Anderljung, Helena Webb, Jan Leike, Allan Dafoe

* Nature Machine Intelligence 3.2 (2021): 104-110 

  Access Paper or Ask Questions

Active Reinforcement Learning: Observing Rewards at a Cost


Nov 24, 2020
David Krueger, Jan Leike, Owain Evans, John Salvatier

* Originally appeared at the NeurIPS 2016 "Future of Interactive Learning Machines (FILM)" workshop 

  Access Paper or Ask Questions

Hidden Incentives for Auto-Induced Distributional Shift


Sep 19, 2020
David Krueger, Tegan Maharaj, Jan Leike


  Access Paper or Ask Questions

Quantifying Differences in Reward Functions


Jun 24, 2020
Adam Gleave, Michael Dennis, Shane Legg, Stuart Russell, Jan Leike

* 8 pages main paper, 29 pages total 

  Access Paper or Ask Questions

Pitfalls of learning a reward function online


Apr 28, 2020
Stuart Armstrong, Jan Leike, Laurent Orseau, Shane Legg


  Access Paper or Ask Questions

Learning Human Objectives by Evaluating Hypothetical Behavior


Dec 05, 2019
Siddharth Reddy, Anca D. Dragan, Sergey Levine, Shane Legg, Jan Leike


  Access Paper or Ask Questions

Scaling shared model governance via model splitting


Dec 14, 2018
Miljan Martic, Jan Leike, Andrew Trask, Matteo Hessel, Shane Legg, Pushmeet Kohli

* 9 pages 

  Access Paper or Ask Questions

Scalable agent alignment via reward modeling: a research direction


Nov 19, 2018
Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg


  Access Paper or Ask Questions

Reward learning from human preferences and demonstrations in Atari


Nov 15, 2018
Borja Ibarz, Jan Leike, Tobias Pohlen, Geoffrey Irving, Shane Legg, Dario Amodei

* NIPS 2018 

  Access Paper or Ask Questions

Learning to Understand Goal Specifications by Modelling Reward


Oct 02, 2018
Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Pushmeet Kohli, Edward Grefenstette

* 18 pages, 8 figures 

  Access Paper or Ask Questions

AI Safety Gridworlds


Nov 28, 2017
Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A. Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, Shane Legg


  Access Paper or Ask Questions

Deep reinforcement learning from human preferences


Jul 13, 2017
Paul Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei


  Access Paper or Ask Questions

Universal Reinforcement Learning Algorithms: Survey and Experiments


May 30, 2017
John Aslanides, Jan Leike, Marcus Hutter

* 8 pages, 6 figures, Twenty-sixth International Joint Conference on Artificial Intelligence (IJCAI-17) 

  Access Paper or Ask Questions

Generalised Discount Functions applied to a Monte-Carlo AImu Implementation


Mar 03, 2017
Sean Lamont, John Aslanides, Jan Leike, Marcus Hutter

* 12 pages, 4 figures 

  Access Paper or Ask Questions

Nonparametric General Reinforcement Learning


Nov 28, 2016
Jan Leike

* PhD thesis 

  Access Paper or Ask Questions

Exploration Potential


Nov 18, 2016
Jan Leike

* 10 pages, including proofs 

  Access Paper or Ask Questions

A Formal Solution to the Grain of Truth Problem


Sep 16, 2016
Jan Leike, Jessica Taylor, Benya Fallenstein

* UAI 2016 

  Access Paper or Ask Questions

Thompson Sampling is Asymptotically Optimal in General Environments


Jun 03, 2016
Jan Leike, Tor Lattimore, Laurent Orseau, Marcus Hutter

* UAI 2016 

  Access Paper or Ask Questions

Loss Bounds and Time Complexity for Speed Priors


Apr 12, 2016
Daniel Filan, Marcus Hutter, Jan Leike

* AISTATS 2016 

  Access Paper or Ask Questions

On the Computability of AIXI


Oct 19, 2015
Jan Leike, Marcus Hutter

* UAI 2015 

  Access Paper or Ask Questions

Bad Universal Priors and Notions of Optimality


Oct 16, 2015
Jan Leike, Marcus Hutter

* COLT 2015 

  Access Paper or Ask Questions

On the Computability of Solomonoff Induction and Knowledge-Seeking


Jul 15, 2015
Jan Leike, Marcus Hutter

* ALT 2015 

  Access Paper or Ask Questions

Solomonoff Induction Violates Nicod's Criterion


Jul 15, 2015
Jan Leike, Marcus Hutter

* ALT 2015 

  Access Paper or Ask Questions

Sequential Extensions of Causal and Evidential Decision Theory


Jun 24, 2015
Tom Everitt, Jan Leike, Marcus Hutter

* ADT 2015 

  Access Paper or Ask Questions

A Definition of Happiness for Reinforcement Learning Agents


May 18, 2015
Mayank Daswani, Jan Leike

* AGI 2015 

  Access Paper or Ask Questions