Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Self-critiquing models for assisting human evaluators



William Saunders , Catherine Yeh , Jeff Wu , Steven Bills , Long Ouyang , Jonathan Ward , Jan Leike


   Access Paper or Ask Questions

Training language models to follow instructions with human feedback



Long Ouyang , Jeff Wu , Xu Jiang , Diogo Almeida , Carroll L. Wainwright , Pamela Mishkin , Chong Zhang , Sandhini Agarwal , Katarina Slama , Alex Ray , John Schulman , Jacob Hilton , Fraser Kelton , Luke Miller , Maddie Simens , Amanda Askell , Peter Welinder , Paul Christiano , Jan Leike , Ryan Lowe


   Access Paper or Ask Questions

Safe Deep RL in 3D Environments using Human Feedback



Matthew Rahtz , Vikrant Varma , Ramana Kumar , Zachary Kenton , Shane Legg , Jan Leike


   Access Paper or Ask Questions

Recursively Summarizing Books with Human Feedback



Jeff Wu , Long Ouyang , Daniel M. Ziegler , Nisan Stiennon , Ryan Lowe , Jan Leike , Paul Christiano


   Access Paper or Ask Questions

Evaluating Large Language Models Trained on Code



Mark Chen , Jerry Tworek , Heewoo Jun , Qiming Yuan , Henrique Ponde de Oliveira Pinto , Jared Kaplan , Harri Edwards , Yuri Burda , Nicholas Joseph , Greg Brockman , Alex Ray , Raul Puri , Gretchen Krueger , Michael Petrov , Heidy Khlaaf , Girish Sastry , Pamela Mishkin , Brooke Chan , Scott Gray , Nick Ryder , Mikhail Pavlov , Alethea Power , Lukasz Kaiser , Mohammad Bavarian , Clemens Winter , Philippe Tillet , Felipe Petroski Such , Dave Cummings , Matthias Plappert , Fotios Chantzis , Elizabeth Barnes , Ariel Herbert-Voss , William Hebgen Guss , Alex Nichol , Alex Paino , Nikolas Tezak , Jie Tang , Igor Babuschkin , Suchir Balaji , Shantanu Jain , William Saunders , Christopher Hesse , Andrew N. Carr , Jan Leike , Josh Achiam , Vedant Misra , Evan Morikawa , Alec Radford , Matthew Knight , Miles Brundage , Mira Murati , Katie Mayer , Peter Welinder , Bob McGrew , Dario Amodei , Sam McCandlish , Ilya Sutskever , Wojciech Zaremba

* corrected typos, added references, added authors, added acknowledgements 

   Access Paper or Ask Questions

Institutionalising Ethics in AI through Broader Impact Requirements



Carina Prunkl , Carolyn Ashurst , Markus Anderljung , Helena Webb , Jan Leike , Allan Dafoe

* Nature Machine Intelligence 3.2 (2021): 104-110 

   Access Paper or Ask Questions

Active Reinforcement Learning: Observing Rewards at a Cost



David Krueger , Jan Leike , Owain Evans , John Salvatier

* Originally appeared at the NeurIPS 2016 "Future of Interactive Learning Machines (FILM)" workshop 

   Access Paper or Ask Questions

1
2
3
4
>>