Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alan W Black

A Dataset for Document Grounded Conversations

Sep 19, 2018
Kangyan Zhou, Shrimai Prabhumoye, Alan W Black

Figure 1 for A Dataset for Document Grounded Conversations

Figure 2 for A Dataset for Document Grounded Conversations

Figure 3 for A Dataset for Document Grounded Conversations

This paper introduces a document grounded dataset for text conversations. We define "Document Grounded Conversations" as conversations that are about the contents of a specified document. In this dataset the specified documents were Wikipedia articles about popular movies. The dataset contains 4112 conversations with an average of 21.43 turns per conversation. This positions this dataset to not only provide a relevant chat history while generating responses but also provide a source of information that the models could use. We describe two neural architectures that provide benchmark performance on the task of generating the next response. We also evaluate our models for engagement and fluency, and find that the information from the document helps in generating more engaging and fluent responses.

* Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018

Via

Access Paper or Ask Questions

Style Transfer Through Multilingual and Feedback-Based Back-Translation

Sep 17, 2018
Shrimai Prabhumoye, Yulia Tsvetkov, Alan W Black, Ruslan Salakhutdinov

Figure 1 for Style Transfer Through Multilingual and Feedback-Based Back-Translation

Figure 2 for Style Transfer Through Multilingual and Feedback-Based Back-Translation

Figure 3 for Style Transfer Through Multilingual and Feedback-Based Back-Translation

Figure 4 for Style Transfer Through Multilingual and Feedback-Based Back-Translation

Style transfer is the task of transferring an attribute of a sentence (e.g., formality) while maintaining its semantic content. The key challenge in style transfer is to strike a balance between the competing goals, one to preserve meaning and the other to improve the style transfer accuracy. Prior research has identified that the task of meaning preservation is generally harder to attain and evaluate. This paper proposes two extensions of the state-of-the-art style transfer models aiming at improving the meaning preservation in style transfer. Our evaluation shows that these extensions help to ground meaning better while improving the transfer accuracy.

Via

Access Paper or Ask Questions

Data Augmentation for Neural Online Chat Response Selection

Sep 03, 2018
Wenchao Du, Alan W Black

Figure 1 for Data Augmentation for Neural Online Chat Response Selection

Figure 2 for Data Augmentation for Neural Online Chat Response Selection

Figure 3 for Data Augmentation for Neural Online Chat Response Selection

Figure 4 for Data Augmentation for Neural Online Chat Response Selection

Data augmentation seeks to manipulate the available data for training to improve the generalization ability of models. We investigate two data augmentation proxies, permutation and flipping, for neural dialog response selection task on various models over multiple datasets, including both Chinese and English languages. Different from standard data augmentation techniques, our method combines the original and synthesized data for prediction. Empirical results show that our approach can gain 1 to 3 recall-at-1 points over baseline models in both full-scale and small-scale settings.

* EMNLP 2018 Workshop

Via

Access Paper or Ask Questions

Generating Mandarin and Cantonese F0 Contours with Decision Trees and BLSTMs

Jul 04, 2018
Weidong Yuan, Alan W Black

Figure 1 for Generating Mandarin and Cantonese F0 Contours with Decision Trees and BLSTMs

Figure 2 for Generating Mandarin and Cantonese F0 Contours with Decision Trees and BLSTMs

Figure 3 for Generating Mandarin and Cantonese F0 Contours with Decision Trees and BLSTMs

Figure 4 for Generating Mandarin and Cantonese F0 Contours with Decision Trees and BLSTMs

This paper models the fundamental frequency contours on both Mandarin and Cantonese speech with decision trees and DNNs (deep neural networks). Different kinds of f0 representations and model architectures are tested for decision trees and DNNs. A new model called Additive-BLSTM (additive bidirectional long short term memory) that predicts a base f0 contour and a residual f0 contour with two BLSTMs is proposed. With respect to objective measures of RMSE and correlation, applying tone-dependent trees together with sample normalization and delta feature regularization within decision tree framework performs best. While the new Additive-BLSTM model with delta feature regularization performs even better. Subjective listening tests on both Mandarin and Cantonese comparing Random Forest model (multiple decision trees) and the Additive-BLSTM model were also held and confirmed the advantage of the new model according to the listeners' preference.

* 5 pages

Via

Access Paper or Ask Questions

Style Transfer Through Back-Translation

May 24, 2018
Shrimai Prabhumoye, Yulia Tsvetkov, Ruslan Salakhutdinov, Alan W Black

Figure 1 for Style Transfer Through Back-Translation

Figure 2 for Style Transfer Through Back-Translation

Figure 3 for Style Transfer Through Back-Translation

Figure 4 for Style Transfer Through Back-Translation

Style transfer is the task of rephrasing the text to contain specific stylistic properties without changing the intent or affect within the context. This paper introduces a new method for automatic style transfer. We first learn a latent representation of the input sentence which is grounded in a language translation model in order to better preserve the meaning of the sentence while reducing stylistic properties. Then adversarial generation techniques are used to make the output match the desired style. We evaluate this technique on three different style transformations: sentiment, gender and political slant. Compared to two state-of-the-art style transfer modeling techniques we show improvements both in automatic evaluation of style transfer and in manual evaluation of meaning preservation and fluency.

* Accepted at ACL 2018

Via

Access Paper or Ask Questions

Linguistic Markers of Influence in Informal Interactions

Jul 14, 2017
Shrimai Prabhumoye, Samridhi Choudhary, Evangelia Spiliopoulou, Christopher Bogart, Carolyn Penstein Rose, Alan W Black

Figure 1 for Linguistic Markers of Influence in Informal Interactions

Figure 2 for Linguistic Markers of Influence in Informal Interactions

Figure 3 for Linguistic Markers of Influence in Informal Interactions

Figure 4 for Linguistic Markers of Influence in Informal Interactions

There has been a long standing interest in understanding `Social Influence' both in Social Sciences and in Computational Linguistics. In this paper, we present a novel approach to study and measure interpersonal influence in daily interactions. Motivated by the basic principles of influence, we attempt to identify indicative linguistic features of the posts in an online knitting community. We present the scheme used to operationalize and label the posts with indicator features. Experiments with the identified features show an improvement in the classification accuracy of influence by 3.15%. Our results illustrate the important correlation between the characteristics of the language and its potential to influence others.

* 10 pages, Accepted in NLP+CSS workshop for ACL (Association for Computational Linguistics) 2017

Via

Access Paper or Ask Questions

Learning Conversational Systems that Interleave Task and Non-Task Content

Mar 01, 2017
Zhou Yu, Alan W Black, Alexander I. Rudnicky

Figure 1 for Learning Conversational Systems that Interleave Task and Non-Task Content

Figure 2 for Learning Conversational Systems that Interleave Task and Non-Task Content

Figure 3 for Learning Conversational Systems that Interleave Task and Non-Task Content

Figure 4 for Learning Conversational Systems that Interleave Task and Non-Task Content

Task-oriented dialog systems have been applied in various tasks, such as automated personal assistants, customer service providers and tutors. These systems work well when users have clear and explicit intentions that are well-aligned to the systems' capabilities. However, they fail if users intentions are not explicit. To address this shortcoming, we propose a framework to interleave non-task content (i.e. everyday social conversation) into task conversations. When the task content fails, the system can still keep the user engaged with the non-task content. We trained a policy using reinforcement learning algorithms to promote long-turn conversation coherence and consistency, so that the system can have smooth transitions between task and non-task content. To test the effectiveness of the proposed framework, we developed a movie promotion dialog system. Experiments with human users indicate that a system that interleaves social and task content achieves a better task success rate and is also rated as more engaging compared to a pure task-oriented system.

* Dialog Systems, Reinforcement Learning

Via

Access Paper or Ask Questions

Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning

May 12, 2016
Yulia Tsvetkov, Sunayana Sitaram, Manaal Faruqui, Guillaume Lample, Patrick Littell, David Mortensen, Alan W Black, Lori Levin, Chris Dyer

Figure 1 for Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning

Figure 2 for Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning

Figure 3 for Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning

Figure 4 for Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning

We introduce polyglot language models, recurrent neural network models trained to predict symbol sequences in many different languages using shared representations of symbols and conditioning on typological information about the language to be predicted. We apply these to the problem of modeling phone sequences---a domain in which universal symbol inventories and cross-linguistically shared feature representations are a natural fit. Intrinsic evaluation on held-out perplexity, qualitative analysis of the learned representations, and extrinsic evaluation in two downstream applications that make use of phonetic features show (i) that polyglot models better generalize to held-out data than comparable monolingual models and (ii) that polyglot phonetic feature representations are of higher quality than those learned monolingually.

* Proceedings of NAACL 2016; 10 pages

Via

Access Paper or Ask Questions

Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis

Jan 26, 2016
Prasanna Kumar Muthukumar, Alan W Black

Figure 1 for Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis

Figure 2 for Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis

Figure 3 for Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis

Figure 4 for Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis

In the last two years, there have been numerous papers that have looked into using Deep Neural Networks to replace the acoustic model in traditional statistical parametric speech synthesis. However, far less attention has been paid to approaches like DNN-based postfiltering where DNNs work in conjunction with traditional acoustic models. In this paper, we investigate the use of Recurrent Neural Networks as a potential postfilter for synthesis. We explore the possibility of replacing existing postfilters, as well as highlight the ease with which arbitrary new features can be added as input to the postfilter. We also tried a novel approach of jointly training the Classification And Regression Tree and the postfilter, rather than the traditional approach of training them independently.

Via

Access Paper or Ask Questions

Character-based Neural Machine Translation

Nov 14, 2015
Wang Ling, Isabel Trancoso, Chris Dyer, Alan W Black

Figure 1 for Character-based Neural Machine Translation

Figure 2 for Character-based Neural Machine Translation

Figure 3 for Character-based Neural Machine Translation

Figure 4 for Character-based Neural Machine Translation

We introduce a neural machine translation model that views the input and output sentences as sequences of characters rather than words. Since word-level information provides a crucial source of bias, our input model composes representations of character sequences into representations of words (as determined by whitespace boundaries), and then these are translated using a joint attention/translation model. In the target language, the translation is modeled as a sequence of word vectors, but each word is generated one character at a time, conditional on the previous character generations in each word. As the representation and generation of words is performed at the character level, our model is capable of interpreting and generating unseen word forms. A secondary benefit of this approach is that it alleviates much of the challenges associated with preprocessing/tokenization of the source and target languages. We show that our model can achieve translation results that are on par with conventional word-based models.

Via

Access Paper or Ask Questions