Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Self-critiquing models for assisting human evaluators



William Saunders , Catherine Yeh , Jeff Wu , Steven Bills , Long Ouyang , Jonathan Ward , Jan Leike


   Access Paper or Ask Questions

Training language models to follow instructions with human feedback



Long Ouyang , Jeff Wu , Xu Jiang , Diogo Almeida , Carroll L. Wainwright , Pamela Mishkin , Chong Zhang , Sandhini Agarwal , Katarina Slama , Alex Ray , John Schulman , Jacob Hilton , Fraser Kelton , Luke Miller , Maddie Simens , Amanda Askell , Peter Welinder , Paul Christiano , Jan Leike , Ryan Lowe


   Access Paper or Ask Questions

WebGPT: Browser-assisted question-answering with human feedback



Reiichiro Nakano , Jacob Hilton , Suchir Balaji , Jeff Wu , Long Ouyang , Christina Kim , Christopher Hesse , Shantanu Jain , Vineet Kosaraju , William Saunders , Xu Jiang , Karl Cobbe , Tyna Eloundou , Gretchen Krueger , Kevin Button , Matthew Knight , Benjamin Chess , John Schulman

* 30 pages 

   Access Paper or Ask Questions

Recursively Summarizing Books with Human Feedback



Jeff Wu , Long Ouyang , Daniel M. Ziegler , Nisan Stiennon , Ryan Lowe , Jan Leike , Paul Christiano


   Access Paper or Ask Questions

Learning to summarize from human feedback



Nisan Stiennon , Long Ouyang , Jeff Wu , Daniel M. Ziegler , Ryan Lowe , Chelsea Voss , Alec Radford , Dario Amodei , Paul Christiano


   Access Paper or Ask Questions

Bayesian Inference of Regular Expressions from Human-Generated Example Strings



Long Ouyang


   Access Paper or Ask Questions

Pedagogical learning



Long Ouyang , Michael C. Frank


   Access Paper or Ask Questions

Practical optimal experiment design with probabilistic programs



Long Ouyang , Michael Henry Tessler , Daniel Ly , Noah Goodman


   Access Paper or Ask Questions