Alert button
Picture for Noah Jones

Noah Jones

Alert button

Human-centric Dialog Training via Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 12, 2020
Natasha Jaques, Judy Hanwen Shen, Asma Ghandeharioun, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Shane Gu, Rosalind Picard

Figure 1 for Human-centric Dialog Training via Offline Reinforcement Learning
Figure 2 for Human-centric Dialog Training via Offline Reinforcement Learning
Figure 3 for Human-centric Dialog Training via Offline Reinforcement Learning
Figure 4 for Human-centric Dialog Training via Offline Reinforcement Learning
Viaarxiv icon

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog

Add code
Bookmark button
Alert button
Jul 08, 2019
Natasha Jaques, Asma Ghandeharioun, Judy Hanwen Shen, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard

Figure 1 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 2 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 3 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 4 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Viaarxiv icon

Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems

Add code
Bookmark button
Alert button
Jun 21, 2019
Asma Ghandeharioun, Judy Hanwen Shen, Natasha Jaques, Craig Ferguson, Noah Jones, Agata Lapedriza, Rosalind Picard

Figure 1 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Figure 2 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Figure 3 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Figure 4 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Viaarxiv icon