Alert button
Picture for Long Ouyang

Long Ouyang

Alert button

Self-critiquing models for assisting human evaluators

Add code
Bookmark button
Alert button
Jun 14, 2022
William Saunders, Catherine Yeh, Jeff Wu, Steven Bills, Long Ouyang, Jonathan Ward, Jan Leike

Figure 1 for Self-critiquing models for assisting human evaluators
Figure 2 for Self-critiquing models for assisting human evaluators
Figure 3 for Self-critiquing models for assisting human evaluators
Figure 4 for Self-critiquing models for assisting human evaluators
Viaarxiv icon

Training language models to follow instructions with human feedback

Add code
Bookmark button
Alert button
Mar 04, 2022
Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe

Figure 1 for Training language models to follow instructions with human feedback
Figure 2 for Training language models to follow instructions with human feedback
Figure 3 for Training language models to follow instructions with human feedback
Figure 4 for Training language models to follow instructions with human feedback
Viaarxiv icon

WebGPT: Browser-assisted question-answering with human feedback

Add code
Bookmark button
Alert button
Dec 17, 2021
Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman

Figure 1 for WebGPT: Browser-assisted question-answering with human feedback
Figure 2 for WebGPT: Browser-assisted question-answering with human feedback
Figure 3 for WebGPT: Browser-assisted question-answering with human feedback
Figure 4 for WebGPT: Browser-assisted question-answering with human feedback
Viaarxiv icon

Recursively Summarizing Books with Human Feedback

Add code
Bookmark button
Alert button
Sep 27, 2021
Jeff Wu, Long Ouyang, Daniel M. Ziegler, Nisan Stiennon, Ryan Lowe, Jan Leike, Paul Christiano

Figure 1 for Recursively Summarizing Books with Human Feedback
Figure 2 for Recursively Summarizing Books with Human Feedback
Figure 3 for Recursively Summarizing Books with Human Feedback
Figure 4 for Recursively Summarizing Books with Human Feedback
Viaarxiv icon

Learning to summarize from human feedback

Add code
Bookmark button
Alert button
Sep 02, 2020
Nisan Stiennon, Long Ouyang, Jeff Wu, Daniel M. Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Figure 1 for Learning to summarize from human feedback
Figure 2 for Learning to summarize from human feedback
Figure 3 for Learning to summarize from human feedback
Figure 4 for Learning to summarize from human feedback
Viaarxiv icon

Bayesian Inference of Regular Expressions from Human-Generated Example Strings

Add code
Bookmark button
Alert button
Sep 26, 2018
Long Ouyang

Figure 1 for Bayesian Inference of Regular Expressions from Human-Generated Example Strings
Figure 2 for Bayesian Inference of Regular Expressions from Human-Generated Example Strings
Viaarxiv icon

Pedagogical learning

Add code
Bookmark button
Alert button
Nov 30, 2017
Long Ouyang, Michael C. Frank

Figure 1 for Pedagogical learning
Figure 2 for Pedagogical learning
Figure 3 for Pedagogical learning
Figure 4 for Pedagogical learning
Viaarxiv icon

Practical optimal experiment design with probabilistic programs

Add code
Bookmark button
Alert button
Aug 17, 2016
Long Ouyang, Michael Henry Tessler, Daniel Ly, Noah Goodman

Figure 1 for Practical optimal experiment design with probabilistic programs
Figure 2 for Practical optimal experiment design with probabilistic programs
Figure 3 for Practical optimal experiment design with probabilistic programs
Figure 4 for Practical optimal experiment design with probabilistic programs
Viaarxiv icon