Picture for Joelle Pineau

Joelle Pineau

Editors

A Large-Scale, Open-Domain, Mixed-Interface Dialogue-Based ITS for STEM

Add code
May 06, 2020
Figure 1 for A Large-Scale, Open-Domain, Mixed-Interface Dialogue-Based ITS for STEM
Viaarxiv icon

Learning an Unreferenced Metric for Online Dialogue Evaluation

Add code
May 01, 2020
Figure 1 for Learning an Unreferenced Metric for Online Dialogue Evaluation
Figure 2 for Learning an Unreferenced Metric for Online Dialogue Evaluation
Figure 3 for Learning an Unreferenced Metric for Online Dialogue Evaluation
Figure 4 for Learning an Unreferenced Metric for Online Dialogue Evaluation
Viaarxiv icon

Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)

Add code
Apr 02, 2020
Figure 1 for Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
Figure 2 for Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
Figure 3 for Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
Figure 4 for Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
Viaarxiv icon

Evaluating Logical Generalization in Graph Neural Networks

Add code
Mar 14, 2020
Figure 1 for Evaluating Logical Generalization in Graph Neural Networks
Figure 2 for Evaluating Logical Generalization in Graph Neural Networks
Figure 3 for Evaluating Logical Generalization in Graph Neural Networks
Figure 4 for Evaluating Logical Generalization in Graph Neural Networks
Viaarxiv icon

Interference and Generalization in Temporal Difference Learning

Add code
Mar 13, 2020
Figure 1 for Interference and Generalization in Temporal Difference Learning
Figure 2 for Interference and Generalization in Temporal Difference Learning
Figure 3 for Interference and Generalization in Temporal Difference Learning
Figure 4 for Interference and Generalization in Temporal Difference Learning
Viaarxiv icon

Invariant Causal Prediction for Block MDPs

Add code
Mar 12, 2020
Figure 1 for Invariant Causal Prediction for Block MDPs
Figure 2 for Invariant Causal Prediction for Block MDPs
Figure 3 for Invariant Causal Prediction for Block MDPs
Figure 4 for Invariant Causal Prediction for Block MDPs
Viaarxiv icon

Stable Policy Optimization via Off-Policy Divergence Regularization

Add code
Mar 09, 2020
Figure 1 for Stable Policy Optimization via Off-Policy Divergence Regularization
Figure 2 for Stable Policy Optimization via Off-Policy Divergence Regularization
Figure 3 for Stable Policy Optimization via Off-Policy Divergence Regularization
Figure 4 for Stable Policy Optimization via Off-Policy Divergence Regularization
Viaarxiv icon

Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

Add code
Feb 24, 2020
Figure 1 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Figure 2 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Figure 3 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Figure 4 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Viaarxiv icon

Provably efficient reconstruction of policy networks

Add code
Feb 07, 2020
Figure 1 for Provably efficient reconstruction of policy networks
Figure 2 for Provably efficient reconstruction of policy networks
Figure 3 for Provably efficient reconstruction of policy networks
Figure 4 for Provably efficient reconstruction of policy networks
Viaarxiv icon

On the interaction between supervision and self-play in emergent communication

Add code
Feb 04, 2020
Figure 1 for On the interaction between supervision and self-play in emergent communication
Figure 2 for On the interaction between supervision and self-play in emergent communication
Figure 3 for On the interaction between supervision and self-play in emergent communication
Figure 4 for On the interaction between supervision and self-play in emergent communication
Viaarxiv icon