Alert button
Picture for Carroll L. Wainwright

Carroll L. Wainwright

Alert button

Training language models to follow instructions with human feedback

Mar 04, 2022
Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe

Figure 1 for Training language models to follow instructions with human feedback
Figure 2 for Training language models to follow instructions with human feedback
Figure 3 for Training language models to follow instructions with human feedback
Figure 4 for Training language models to follow instructions with human feedback
Viaarxiv icon

SafeLife 1.0: Exploring Side Effects in Complex Environments

Dec 03, 2019
Carroll L. Wainwright, Peter Eckersley

Figure 1 for SafeLife 1.0: Exploring Side Effects in Complex Environments
Figure 2 for SafeLife 1.0: Exploring Side Effects in Complex Environments
Figure 3 for SafeLife 1.0: Exploring Side Effects in Complex Environments
Figure 4 for SafeLife 1.0: Exploring Side Effects in Complex Environments
Viaarxiv icon