Alert button
Picture for Natasha Jaques

Natasha Jaques

Alert button

PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning

Add code
Bookmark button
Alert button
Feb 24, 2021
Angelos Filos, Clare Lyle, Yarin Gal, Sergey Levine, Natasha Jaques, Gregory Farquhar

Figure 1 for PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
Figure 2 for PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
Figure 3 for PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
Figure 4 for PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
Viaarxiv icon

Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design

Add code
Bookmark button
Alert button
Dec 03, 2020
Michael Dennis, Natasha Jaques, Eugene Vinitsky, Alexandre Bayen, Stuart Russell, Andrew Critch, Sergey Levine

Figure 1 for Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
Figure 2 for Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
Figure 3 for Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
Figure 4 for Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
Viaarxiv icon

Human-centric Dialog Training via Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 12, 2020
Natasha Jaques, Judy Hanwen Shen, Asma Ghandeharioun, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Shane Gu, Rosalind Picard

Figure 1 for Human-centric Dialog Training via Offline Reinforcement Learning
Figure 2 for Human-centric Dialog Training via Offline Reinforcement Learning
Figure 3 for Human-centric Dialog Training via Offline Reinforcement Learning
Figure 4 for Human-centric Dialog Training via Offline Reinforcement Learning
Viaarxiv icon

Multi-agent Social Reinforcement Learning Improves Generalization

Add code
Bookmark button
Alert button
Oct 01, 2020
Kamal Ndousse, Douglas Eck, Sergey Levine, Natasha Jaques

Figure 1 for Multi-agent Social Reinforcement Learning Improves Generalization
Figure 2 for Multi-agent Social Reinforcement Learning Improves Generalization
Figure 3 for Multi-agent Social Reinforcement Learning Improves Generalization
Figure 4 for Multi-agent Social Reinforcement Learning Improves Generalization
Viaarxiv icon

Hierarchical Reinforcement Learning for Open-Domain Dialog

Add code
Bookmark button
Alert button
Sep 18, 2019
Abdelrhman Saleh, Natasha Jaques, Asma Ghandeharioun, Judy Hanwen Shen, Rosalind Picard

Figure 1 for Hierarchical Reinforcement Learning for Open-Domain Dialog
Figure 2 for Hierarchical Reinforcement Learning for Open-Domain Dialog
Figure 3 for Hierarchical Reinforcement Learning for Open-Domain Dialog
Figure 4 for Hierarchical Reinforcement Learning for Open-Domain Dialog
Viaarxiv icon

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog

Add code
Bookmark button
Alert button
Jul 08, 2019
Natasha Jaques, Asma Ghandeharioun, Judy Hanwen Shen, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard

Figure 1 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 2 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 3 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 4 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Viaarxiv icon

Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems

Add code
Bookmark button
Alert button
Jun 21, 2019
Asma Ghandeharioun, Judy Hanwen Shen, Natasha Jaques, Craig Ferguson, Noah Jones, Agata Lapedriza, Rosalind Picard

Figure 1 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Figure 2 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Figure 3 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Figure 4 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Viaarxiv icon

Tackling Climate Change with Machine Learning

Add code
Bookmark button
Alert button
Jun 10, 2019
David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

Figure 1 for Tackling Climate Change with Machine Learning
Viaarxiv icon