Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Training language models to follow instructions with human feedback



Long Ouyang , Jeff Wu , Xu Jiang , Diogo Almeida , Carroll L. Wainwright , Pamela Mishkin , Chong Zhang , Sandhini Agarwal , Katarina Slama , Alex Ray , John Schulman , Jacob Hilton , Fraser Kelton , Luke Miller , Maddie Simens , Amanda Askell , Peter Welinder , Paul Christiano , Jan Leike , Ryan Lowe


   Access Paper or Ask Questions

Recursively Summarizing Books with Human Feedback



Jeff Wu , Long Ouyang , Daniel M. Ziegler , Nisan Stiennon , Ryan Lowe , Jan Leike , Paul Christiano


   Access Paper or Ask Questions

Learning to summarize from human feedback



Nisan Stiennon , Long Ouyang , Jeff Wu , Daniel M. Ziegler , Ryan Lowe , Chelsea Voss , Alec Radford , Dario Amodei , Paul Christiano


   Access Paper or Ask Questions

Ideas for Improving the Field of Machine Learning: Summarizing Discussion from the NeurIPS 2019 Retrospectives Workshop



Shagun Sodhani , Mayoore S. Jaiswal , Lauren Baker , Koustuv Sinha , Carl Shneider , Peter Henderson , Joel Lehman , Ryan Lowe


   Access Paper or Ask Questions

Learning an Unreferenced Metric for Online Dialogue Evaluation



Koustuv Sinha , Prasanna Parthasarathi , Jasmine Wang , Ryan Lowe , William L. Hamilton , Joelle Pineau

* Accepted at ACL 2020, 5 pages 

   Access Paper or Ask Questions

On the interaction between supervision and self-play in emergent communication



Ryan Lowe , Abhinav Gupta , Jakob Foerster , Douwe Kiela , Joelle Pineau

* The first two authors contributed equally. Accepted at ICLR 2020 

   Access Paper or Ask Questions

On the Pitfalls of Measuring Emergent Communication



Ryan Lowe , Jakob Foerster , Y-Lan Boureau , Joelle Pineau , Yann Dauphin

* AAMAS 2019. 13 pages 

   Access Paper or Ask Questions

The Second Conversational Intelligence Challenge (ConvAI2)



Emily Dinan , Varvara Logacheva , Valentin Malykh , Alexander Miller , Kurt Shuster , Jack Urbanek , Douwe Kiela , Arthur Szlam , Iulian Serban , Ryan Lowe , Shrimai Prabhumoye , Alan W Black , Alexander Rudnicky , Jason Williams , Joelle Pineau , Mikhail Burtsev , Jason Weston


   Access Paper or Ask Questions

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments



Ryan Lowe , Yi Wu , Aviv Tamar , Jean Harb , Pieter Abbeel , Igor Mordatch


   Access Paper or Ask Questions

1
2
>>