Alert button
Picture for Jeff Wu

Jeff Wu

Alert button

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Add code
Bookmark button
Alert button
Dec 14, 2023
Collin Burns, Pavel Izmailov, Jan Hendrik Kirchner, Bowen Baker, Leo Gao, Leopold Aschenbrenner, Yining Chen, Adrien Ecoffet, Manas Joglekar, Jan Leike, Ilya Sutskever, Jeff Wu

Viaarxiv icon

Inference of Nonlinear Partial Differential Equations via Constrained Gaussian Processes

Add code
Bookmark button
Alert button
Dec 22, 2022
Zhaohui Li, Shihao Yang, Jeff Wu

Figure 1 for Inference of Nonlinear Partial Differential Equations via Constrained Gaussian Processes
Figure 2 for Inference of Nonlinear Partial Differential Equations via Constrained Gaussian Processes
Figure 3 for Inference of Nonlinear Partial Differential Equations via Constrained Gaussian Processes
Figure 4 for Inference of Nonlinear Partial Differential Equations via Constrained Gaussian Processes
Viaarxiv icon

Self-critiquing models for assisting human evaluators

Add code
Bookmark button
Alert button
Jun 14, 2022
William Saunders, Catherine Yeh, Jeff Wu, Steven Bills, Long Ouyang, Jonathan Ward, Jan Leike

Figure 1 for Self-critiquing models for assisting human evaluators
Figure 2 for Self-critiquing models for assisting human evaluators
Figure 3 for Self-critiquing models for assisting human evaluators
Figure 4 for Self-critiquing models for assisting human evaluators
Viaarxiv icon

Training language models to follow instructions with human feedback

Add code
Bookmark button
Alert button
Mar 04, 2022
Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe

Figure 1 for Training language models to follow instructions with human feedback
Figure 2 for Training language models to follow instructions with human feedback
Figure 3 for Training language models to follow instructions with human feedback
Figure 4 for Training language models to follow instructions with human feedback
Viaarxiv icon

WebGPT: Browser-assisted question-answering with human feedback

Add code
Bookmark button
Alert button
Dec 17, 2021
Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman

Figure 1 for WebGPT: Browser-assisted question-answering with human feedback
Figure 2 for WebGPT: Browser-assisted question-answering with human feedback
Figure 3 for WebGPT: Browser-assisted question-answering with human feedback
Figure 4 for WebGPT: Browser-assisted question-answering with human feedback
Viaarxiv icon

Recursively Summarizing Books with Human Feedback

Add code
Bookmark button
Alert button
Sep 27, 2021
Jeff Wu, Long Ouyang, Daniel M. Ziegler, Nisan Stiennon, Ryan Lowe, Jan Leike, Paul Christiano

Figure 1 for Recursively Summarizing Books with Human Feedback
Figure 2 for Recursively Summarizing Books with Human Feedback
Figure 3 for Recursively Summarizing Books with Human Feedback
Figure 4 for Recursively Summarizing Books with Human Feedback
Viaarxiv icon

Learning to summarize from human feedback

Add code
Bookmark button
Alert button
Sep 02, 2020
Nisan Stiennon, Long Ouyang, Jeff Wu, Daniel M. Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Figure 1 for Learning to summarize from human feedback
Figure 2 for Learning to summarize from human feedback
Figure 3 for Learning to summarize from human feedback
Figure 4 for Learning to summarize from human feedback
Viaarxiv icon

Release Strategies and the Social Impacts of Language Models

Add code
Bookmark button
Alert button
Aug 24, 2019
Irene Solaiman, Miles Brundage, Jack Clark, Amanda Askell, Ariel Herbert-Voss, Jeff Wu, Alec Radford, Jasmine Wang

Viaarxiv icon