Alert button
Picture for Ryan Lowe

Ryan Lowe

Alert button

Training language models to follow instructions with human feedback

Mar 04, 2022
Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe

Figure 1 for Training language models to follow instructions with human feedback
Figure 2 for Training language models to follow instructions with human feedback
Figure 3 for Training language models to follow instructions with human feedback
Figure 4 for Training language models to follow instructions with human feedback
Viaarxiv icon

Recursively Summarizing Books with Human Feedback

Sep 27, 2021
Jeff Wu, Long Ouyang, Daniel M. Ziegler, Nisan Stiennon, Ryan Lowe, Jan Leike, Paul Christiano

Figure 1 for Recursively Summarizing Books with Human Feedback
Figure 2 for Recursively Summarizing Books with Human Feedback
Figure 3 for Recursively Summarizing Books with Human Feedback
Figure 4 for Recursively Summarizing Books with Human Feedback
Viaarxiv icon

Learning to summarize from human feedback

Sep 02, 2020
Nisan Stiennon, Long Ouyang, Jeff Wu, Daniel M. Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Figure 1 for Learning to summarize from human feedback
Figure 2 for Learning to summarize from human feedback
Figure 3 for Learning to summarize from human feedback
Figure 4 for Learning to summarize from human feedback
Viaarxiv icon

Ideas for Improving the Field of Machine Learning: Summarizing Discussion from the NeurIPS 2019 Retrospectives Workshop

Jul 21, 2020
Shagun Sodhani, Mayoore S. Jaiswal, Lauren Baker, Koustuv Sinha, Carl Shneider, Peter Henderson, Joel Lehman, Ryan Lowe

Viaarxiv icon

Learning an Unreferenced Metric for Online Dialogue Evaluation

May 01, 2020
Koustuv Sinha, Prasanna Parthasarathi, Jasmine Wang, Ryan Lowe, William L. Hamilton, Joelle Pineau

Figure 1 for Learning an Unreferenced Metric for Online Dialogue Evaluation
Figure 2 for Learning an Unreferenced Metric for Online Dialogue Evaluation
Figure 3 for Learning an Unreferenced Metric for Online Dialogue Evaluation
Figure 4 for Learning an Unreferenced Metric for Online Dialogue Evaluation
Viaarxiv icon

On the interaction between supervision and self-play in emergent communication

Feb 04, 2020
Ryan Lowe, Abhinav Gupta, Jakob Foerster, Douwe Kiela, Joelle Pineau

Figure 1 for On the interaction between supervision and self-play in emergent communication
Figure 2 for On the interaction between supervision and self-play in emergent communication
Figure 3 for On the interaction between supervision and self-play in emergent communication
Figure 4 for On the interaction between supervision and self-play in emergent communication
Viaarxiv icon

On the Pitfalls of Measuring Emergent Communication

Mar 12, 2019
Ryan Lowe, Jakob Foerster, Y-Lan Boureau, Joelle Pineau, Yann Dauphin

Figure 1 for On the Pitfalls of Measuring Emergent Communication
Figure 2 for On the Pitfalls of Measuring Emergent Communication
Figure 3 for On the Pitfalls of Measuring Emergent Communication
Figure 4 for On the Pitfalls of Measuring Emergent Communication
Viaarxiv icon

The Second Conversational Intelligence Challenge (ConvAI2)

Jan 31, 2019
Emily Dinan, Varvara Logacheva, Valentin Malykh, Alexander Miller, Kurt Shuster, Jack Urbanek, Douwe Kiela, Arthur Szlam, Iulian Serban, Ryan Lowe, Shrimai Prabhumoye, Alan W Black, Alexander Rudnicky, Jason Williams, Joelle Pineau, Mikhail Burtsev, Jason Weston

Figure 1 for The Second Conversational Intelligence Challenge (ConvAI2)
Figure 2 for The Second Conversational Intelligence Challenge (ConvAI2)
Figure 3 for The Second Conversational Intelligence Challenge (ConvAI2)
Figure 4 for The Second Conversational Intelligence Challenge (ConvAI2)
Viaarxiv icon

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Jan 16, 2018
Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, Igor Mordatch

Figure 1 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Figure 2 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Figure 3 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Figure 4 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Viaarxiv icon