Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chris Brockett

A Knowledge-Grounded Neural Conversation Model

Feb 07, 2017

Marjan Ghazvininejad, Chris Brockett, Ming-Wei Chang, Bill Dolan, Jianfeng Gao, Wen-tau Yih, Michel Galley

Figure 1 for A Knowledge-Grounded Neural Conversation Model

Figure 2 for A Knowledge-Grounded Neural Conversation Model

Figure 3 for A Knowledge-Grounded Neural Conversation Model

Figure 4 for A Knowledge-Grounded Neural Conversation Model

Abstract:Neural network models are capable of generating extremely natural sounding conversational interactions. Nevertheless, these models have yet to demonstrate that they can incorporate content in the form of factual information or entity-grounded opinion that would enable them to serve in more task-oriented conversational applications. This paper presents a novel, fully data-driven, and knowledge-grounded neural conversation model aimed at producing more contentful responses without slot filling. We generalize the widely-used Seq2Seq approach by conditioning responses on both conversation history and external "facts", allowing the model to be versatile and applicable in an open-domain setting. Our approach yields significant improvements over a competitive Seq2Seq baseline. Human judges found that our outputs are significantly more informative.

* 10 pages

Via

Access Paper or Ask Questions

Emulating Human Conversations using Convolutional Neural Network-based IR

Jun 22, 2016

Abhay Prakash, Chris Brockett, Puneet Agrawal

Figure 1 for Emulating Human Conversations using Convolutional Neural Network-based IR

Figure 2 for Emulating Human Conversations using Convolutional Neural Network-based IR

Figure 3 for Emulating Human Conversations using Convolutional Neural Network-based IR

Figure 4 for Emulating Human Conversations using Convolutional Neural Network-based IR

Abstract:Conversational agents ("bots") are beginning to be widely used in conversational interfaces. To design a system that is capable of emulating human-like interactions, a conversational layer that can serve as a fabric for chat-like interaction with the agent is needed. In this paper, we introduce a model that employs Information Retrieval by utilizing convolutional deep structured semantic neural network-based features in the ranker to present human-like responses in ongoing conversation with a user. In conversations, accounting for context is critical to the retrieval model; we show that our context-sensitive approach using a Convolutional Deep Structured Semantic Model (cDSSM) with character trigrams significantly outperforms several conventional baselines in terms of the relevance of responses retrieved.

* 5 pages, Neu-IR'16 SIGIR Workshop on Neural Information Retrieval, July 21, 2016, Pisa, Italy

Via

Access Paper or Ask Questions

A Diversity-Promoting Objective Function for Neural Conversation Models

Jun 10, 2016

Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan

Figure 1 for A Diversity-Promoting Objective Function for Neural Conversation Models

Figure 2 for A Diversity-Promoting Objective Function for Neural Conversation Models

Figure 3 for A Diversity-Promoting Objective Function for Neural Conversation Models

Figure 4 for A Diversity-Promoting Objective Function for Neural Conversation Models

Abstract:Sequence-to-sequence neural network models for generation of conversational responses tend to generate safe, commonplace responses (e.g., "I don't know") regardless of the input. We suggest that the traditional objective function, i.e., the likelihood of output (response) given input (message) is unsuited to response generation tasks. Instead we propose using Maximum Mutual Information (MMI) as the objective function in neural models. Experimental results demonstrate that the proposed MMI models produce more diverse, interesting, and appropriate responses, yielding substantive gains in BLEU scores on two conversational datasets and in human evaluations.

* In. Proc of NAACL 2016

Via

Access Paper or Ask Questions

A Persona-Based Neural Conversation Model

Jun 08, 2016

Jiwei Li, Michel Galley, Chris Brockett, Georgios P. Spithourakis, Jianfeng Gao, Bill Dolan

Figure 1 for A Persona-Based Neural Conversation Model

Figure 2 for A Persona-Based Neural Conversation Model

Figure 3 for A Persona-Based Neural Conversation Model

Figure 4 for A Persona-Based Neural Conversation Model

Abstract:We present persona-based models for handling the issue of speaker consistency in neural response generation. A speaker model encodes personas in distributed embeddings that capture individual characteristics such as background information and speaking style. A dyadic speaker-addressee model captures properties of interactions between two interlocutors. Our models yield qualitative performance improvements in both perplexity and BLEU scores over baseline sequence-to-sequence models, with similar gains in speaker consistency as measured by human judges.

* Accepted for publication at ACL 2016

Via

Access Paper or Ask Questions

deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets

Jun 24, 2015

Michel Galley, Chris Brockett, Alessandro Sordoni, Yangfeng Ji, Michael Auli, Chris Quirk, Margaret Mitchell, Jianfeng Gao, Bill Dolan

Figure 1 for deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets

Figure 2 for deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets

Figure 3 for deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets

Figure 4 for deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets

Abstract:We introduce Discriminative BLEU (deltaBLEU), a novel metric for intrinsic evaluation of generated text in tasks that admit a diverse range of possible outputs. Reference strings are scored for quality by human raters on a scale of [-1, +1] to weight multi-reference BLEU. In tasks involving generation of conversational responses, deltaBLEU correlates reasonably with human judgments and outperforms sentence-level and IBM BLEU in terms of both Spearman's rho and Kendall's tau.

* 6 pages, to appear at ACL 2015

Via

Access Paper or Ask Questions

A Neural Network Approach to Context-Sensitive Generation of Conversational Responses

Jun 22, 2015

Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Margaret Mitchell, Jian-Yun Nie, Jianfeng Gao, Bill Dolan

Figure 1 for A Neural Network Approach to Context-Sensitive Generation of Conversational Responses

Figure 2 for A Neural Network Approach to Context-Sensitive Generation of Conversational Responses

Figure 3 for A Neural Network Approach to Context-Sensitive Generation of Conversational Responses

Figure 4 for A Neural Network Approach to Context-Sensitive Generation of Conversational Responses

Abstract:We present a novel response generation system that can be trained end to end on large quantities of unstructured Twitter conversations. A neural network architecture is used to address sparsity issues that arise when integrating contextual information into classic statistical models, allowing the system to take into account previous dialog utterances. Our dynamic-context generative models show consistent gains over both context-sensitive and non-context-sensitive Machine Translation and Information Retrieval baselines.

* A. Sordoni, M. Galley, M. Auli, C. Brockett, Y. Ji, M. Mitchell, J.-Y. Nie, J. Gao, B. Dolan. 2015. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses. In Proc. of NAACL-HLT. Pages 196-205

Via

Access Paper or Ask Questions