Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Noah Jones

Human-centric Dialog Training via Offline Reinforcement Learning


Oct 12, 2020
Natasha Jaques, Judy Hanwen Shen, Asma Ghandeharioun, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Shane Gu, Rosalind Picard

* To appear in EMNLP 2020 (long paper) 

  Access Paper or Ask Questions

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog


Jul 08, 2019
Natasha Jaques, Asma Ghandeharioun, Judy Hanwen Shen, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard


  Access Paper or Ask Questions

Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems


Jun 21, 2019
Asma Ghandeharioun, Judy Hanwen Shen, Natasha Jaques, Craig Ferguson, Noah Jones, Agata Lapedriza, Rosalind Picard


  Access Paper or Ask Questions