Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Training language models to follow instructions with human feedback

Long Ouyang , Jeff Wu , Xu Jiang , Diogo Almeida , Carroll L. Wainwright , Pamela Mishkin , Chong Zhang , Sandhini Agarwal , Katarina Slama , Alex Ray , John Schulman , Jacob Hilton , Fraser Kelton , Luke Miller , Maddie Simens , Amanda Askell , Peter Welinder , Paul Christiano , Jan Leike , Ryan Lowe

   Access Paper or Ask Questions

SafeLife 1.0: Exploring Side Effects in Complex Environments

Carroll L. Wainwright , Peter Eckersley

* Accepted at the 2019 NeurIPS Safety and Robustness in Decision Making Workshop 

   Access Paper or Ask Questions