Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ryan Lowe

Ethical Challenges in Data-Driven Dialogue Systems

Nov 24, 2017

Peter Henderson, Koustuv Sinha, Nicolas Angelard-Gontier, Nan Rosemary Ke, Genevieve Fried, Ryan Lowe, Joelle Pineau

Figure 1 for Ethical Challenges in Data-Driven Dialogue Systems

Figure 2 for Ethical Challenges in Data-Driven Dialogue Systems

Figure 3 for Ethical Challenges in Data-Driven Dialogue Systems

Figure 4 for Ethical Challenges in Data-Driven Dialogue Systems

Abstract:The use of dialogue systems as a medium for human-machine interaction is an increasingly prevalent paradigm. A growing number of dialogue systems use conversation strategies that are learned from large datasets. There are well documented instances where interactions with these system have resulted in biased or even offensive conversations due to the data-driven training process. Here, we highlight potential ethical issues that arise in dialogue systems research, including: implicit biases in data-driven systems, the rise of adversarial examples, potential sources of privacy violations, safety concerns, special considerations for reinforcement learning systems, and reproducibility concerns. We also suggest areas stemming from these issues that deserve further investigation. Through this initial survey, we hope to spur research leading to robust, safe, and ethically sound dialogue systems.

* In Submission to the AAAI/ACM conference on Artificial Intelligence, Ethics, and Society

Via

Access Paper or Ask Questions

A Survey of Available Corpora for Building Data-Driven Dialogue Systems

Mar 21, 2017

Iulian Vlad Serban, Ryan Lowe, Peter Henderson, Laurent Charlin, Joelle Pineau

Figure 1 for A Survey of Available Corpora for Building Data-Driven Dialogue Systems

Abstract:During the past decade, several areas of speech and language understanding have witnessed substantial breakthroughs from the use of data-driven models. In the area of dialogue systems, the trend is less obvious, and most practical systems are still built through significant engineering and expert knowledge. Nevertheless, several recent results suggest that data-driven approaches are feasible and quite promising. To facilitate research in this area, we have carried out a wide survey of publicly available datasets suitable for data-driven learning of dialogue systems. We discuss important characteristics of these datasets, how they can be used to learn diverse dialogue strategies, and their other potential uses. We also examine methods for transfer learning between datasets and the use of external knowledge. Finally, we discuss appropriate choice of evaluation metrics for the learning objective.

* 56 pages including references and appendix, 5 tables and 1 figure; Under review for the Dialogue & Discourse journal. Update: paper has been rewritten and now includes several new datasets

Via

Access Paper or Ask Questions

An Actor-Critic Algorithm for Sequence Prediction

Mar 03, 2017

Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, Yoshua Bengio

Figure 1 for An Actor-Critic Algorithm for Sequence Prediction

Figure 2 for An Actor-Critic Algorithm for Sequence Prediction

Abstract:We present an approach to training neural networks to generate sequences using actor-critic methods from reinforcement learning (RL). Current log-likelihood training methods are limited by the discrepancy between their training and testing modes, as models must generate tokens conditioned on their previous guesses rather than the ground-truth tokens. We address this problem by introducing a \textit{critic} network that is trained to predict the value of an output token, given the policy of an \textit{actor} network. This results in a training procedure that is much closer to the test phase, and allows us to directly optimize for a task-specific score such as BLEU. Crucially, since we leverage these techniques in the supervised learning setting rather than the traditional RL setting, we condition the critic network on the ground-truth output. We show that our method leads to improved performance on both a synthetic task, and for German-English machine translation. Our analysis paves the way for such methods to be applied in natural language generation tasks, such as machine translation, caption generation, and dialogue modelling.

Via

Access Paper or Ask Questions

How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation

Jan 03, 2017

Chia-Wei Liu, Ryan Lowe, Iulian V. Serban, Michael Noseworthy, Laurent Charlin, Joelle Pineau

Figure 1 for How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation

Figure 2 for How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation

Figure 3 for How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation

Figure 4 for How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation

Abstract:We investigate evaluation metrics for dialogue response generation systems where supervised labels, such as task completion, are not available. Recent works in response generation have adopted metrics from machine translation to compare a model's generated response to a single target response. We show that these metrics correlate very weakly with human judgements in the non-technical Twitter domain, and not at all in the technical Ubuntu domain. We provide quantitative and qualitative results highlighting specific weaknesses in existing metrics, and provide recommendations for future development of better automatic evaluation metrics for dialogue systems.

* First 4 authors had equal contribution. 13 pages, 5 tables, 6 figures. EMNLP 2016

Via

Access Paper or Ask Questions

Generative Deep Neural Networks for Dialogue: A Short Review

Nov 18, 2016

Iulian Vlad Serban, Ryan Lowe, Laurent Charlin, Joelle Pineau

Figure 1 for Generative Deep Neural Networks for Dialogue: A Short Review

Figure 2 for Generative Deep Neural Networks for Dialogue: A Short Review

Figure 3 for Generative Deep Neural Networks for Dialogue: A Short Review

Figure 4 for Generative Deep Neural Networks for Dialogue: A Short Review

Abstract:Researchers have recently started investigating deep neural networks for dialogue applications. In particular, generative sequence-to-sequence (Seq2Seq) models have shown promising results for unstructured tasks, such as word-level dialogue response generation. The hope is that such models will be able to leverage massive amounts of data to learn meaningful natural language representations and response generation strategies, while requiring a minimum amount of domain knowledge and hand-crafting. An important challenge is to develop models that can effectively incorporate dialogue context and generate meaningful and diverse responses. In support of this goal, we review recently proposed models based on generative encoder-decoder neural network architectures, and show that these models have better ability to incorporate long-term dialogue history, to model uncertainty and ambiguity in dialogue, and to generate responses with high-level compositional structure.

* 6 pages, 1 figure, 3 tables; NIPS 2016 workshop on Learning Methods for Dialogue

Via

Access Paper or Ask Questions

On the Evaluation of Dialogue Systems with Next Utterance Classification

Jul 23, 2016

Ryan Lowe, Iulian V. Serban, Mike Noseworthy, Laurent Charlin, Joelle Pineau

Figure 1 for On the Evaluation of Dialogue Systems with Next Utterance Classification

Figure 2 for On the Evaluation of Dialogue Systems with Next Utterance Classification

Figure 3 for On the Evaluation of Dialogue Systems with Next Utterance Classification

Abstract:An open challenge in constructing dialogue systems is developing methods for automatically learning dialogue strategies from large amounts of unlabelled data. Recent work has proposed Next-Utterance-Classification (NUC) as a surrogate task for building dialogue systems from text data. In this paper we investigate the performance of humans on this task to validate the relevance of NUC as a method of evaluation. Our results show three main findings: (1) humans are able to correctly classify responses at a rate much better than chance, thus confirming that the task is feasible, (2) human performance levels vary across task domains (we consider 3 datasets) and expertise levels (novice vs experts), thus showing that a range of performance is possible on this type of task, (3) automated dialogue systems built using state-of-the-art machine learning methods have similar performance to the human novices, but worse than the experts, thus confirming the utility of this class of tasks for driving further research in automated dialogue systems.

* Accepted to SIGDIAL 2016 (short paper). 5 pages

Via

Access Paper or Ask Questions

A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

Jun 14, 2016

Iulian Vlad Serban, Alessandro Sordoni, Ryan Lowe, Laurent Charlin, Joelle Pineau, Aaron Courville, Yoshua Bengio

Figure 1 for A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

Figure 2 for A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

Figure 3 for A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

Figure 4 for A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

Abstract:Sequential data often possesses a hierarchical structure with complex dependencies between subsequences, such as found between the utterances in a dialogue. In an effort to model this kind of generative process, we propose a neural network-based generative architecture, with latent stochastic variables that span a variable number of time steps. We apply the proposed model to the task of dialogue response generation and compare it with recent neural network architectures. We evaluate the model performance through automatic evaluation metrics and by carrying out a human evaluation. The experiments demonstrate that our model improves upon recently proposed models and that the latent variables facilitate the generation of long outputs and maintain the context.

* 15 pages, 5 tables, 4 figures

Via

Access Paper or Ask Questions

Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data

May 18, 2016

Teng Long, Ryan Lowe, Jackie Chi Kit Cheung, Doina Precup

Figure 1 for Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data

Figure 2 for Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data

Figure 3 for Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data

Figure 4 for Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data

Abstract:Recent work in learning vector-space embeddings for multi-relational data has focused on combining relational information derived from knowledge bases with distributional information derived from large text corpora. We propose a simple approach that leverages the descriptions of entities or phrases available in lexical resources, in conjunction with distributional semantics, in order to derive a better initialization for training relational models. Applying this initialization to the TransE model results in significant new state-of-the-art performances on the WordNet dataset, decreasing the mean rank from the previous best of 212 to 51. It also results in faster convergence of the entity representations. We find that there is a trade-off between improving the mean rank and the hits@10 with this approach. This illustrates that much remains to be understood regarding performance improvements in relational models.

* 6 pages. Accepted to ACL 2016 (short paper)

Via

Access Paper or Ask Questions

The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems

Feb 04, 2016

Ryan Lowe, Nissan Pow, Iulian Serban, Joelle Pineau

Figure 1 for The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems

Figure 2 for The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems

Figure 3 for The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems

Figure 4 for The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems

Abstract:This paper introduces the Ubuntu Dialogue Corpus, a dataset containing almost 1 million multi-turn dialogues, with a total of over 7 million utterances and 100 million words. This provides a unique resource for research into building dialogue managers based on neural language models that can make use of large amounts of unlabeled data. The dataset has both the multi-turn property of conversations in the Dialog State Tracking Challenge datasets, and the unstructured nature of interactions from microblog services such as Twitter. We also describe two neural learning architectures suitable for analyzing this dataset, and provide benchmark performance on the task of selecting the best next response.

* SIGDIAL 2015. 10 pages, 5 figures. Update includes link to new version of the dataset, with some added features and bug fixes. See: https://github.com/rkadlec/ubuntu-ranking-dataset-creator

Via

Access Paper or Ask Questions