Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Adversarial Training for High-Stakes Reliability



Daniel M. Ziegler , Seraphina Nix , Lawrence Chan , Tim Bauman , Peter Schmidt-Nielsen , Tao Lin , Adam Scherlis , Noa Nabeshima , Ben Weinstein-Raun , Daniel de Haas , Buck Shlegeris , Nate Thomas

* 31 pages, 6 figures, small tweak 

   Access Paper or Ask Questions

Recursively Summarizing Books with Human Feedback



Jeff Wu , Long Ouyang , Daniel M. Ziegler , Nisan Stiennon , Ryan Lowe , Jan Leike , Paul Christiano


   Access Paper or Ask Questions

Scaling Laws for Autoregressive Generative Modeling



Tom Henighan , Jared Kaplan , Mor Katz , Mark Chen , Christopher Hesse , Jacob Jackson , Heewoo Jun , Tom B. Brown , Prafulla Dhariwal , Scott Gray , Chris Hallacy , Benjamin Mann , Alec Radford , Aditya Ramesh , Nick Ryder , Daniel M. Ziegler , John Schulman , Dario Amodei , Sam McCandlish

* 20+17 pages, 33 figures; added appendix with additional language results 

   Access Paper or Ask Questions

Learning to summarize from human feedback



Nisan Stiennon , Long Ouyang , Jeff Wu , Daniel M. Ziegler , Ryan Lowe , Chelsea Voss , Alec Radford , Dario Amodei , Paul Christiano


   Access Paper or Ask Questions

Language Models are Few-Shot Learners



Tom B. Brown , Benjamin Mann , Nick Ryder , Melanie Subbiah , Jared Kaplan , Prafulla Dhariwal , Arvind Neelakantan , Pranav Shyam , Girish Sastry , Amanda Askell , Sandhini Agarwal , Ariel Herbert-Voss , Gretchen Krueger , Tom Henighan , Rewon Child , Aditya Ramesh , Daniel M. Ziegler , Jeffrey Wu , Clemens Winter , Christopher Hesse , Mark Chen , Eric Sigler , Mateusz Litwin , Scott Gray , Benjamin Chess , Jack Clark , Christopher Berner , Sam McCandlish , Alec Radford , Ilya Sutskever , Dario Amodei

* 40+32 pages 

   Access Paper or Ask Questions

Fine-Tuning Language Models from Human Preferences



Daniel M. Ziegler , Nisan Stiennon , Jeffrey Wu , Tom B. Brown , Alec Radford , Dario Amodei , Paul Christiano , Geoffrey Irving


   Access Paper or Ask Questions