Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hui Su

Select and Attend: Towards Controllable Content Selection in Text Generation

Sep 10, 2019
Xiaoyu Shen, Jun Suzuki, Kentaro Inui, Hui Su, Dietrich Klakow, Satoshi Sekine

Figure 1 for Select and Attend: Towards Controllable Content Selection in Text Generation

Figure 2 for Select and Attend: Towards Controllable Content Selection in Text Generation

Figure 3 for Select and Attend: Towards Controllable Content Selection in Text Generation

Figure 4 for Select and Attend: Towards Controllable Content Selection in Text Generation

Many text generation tasks naturally contain two steps: content selection and surface realization. Current neural encoder-decoder models conflate both steps into a black-box architecture. As a result, the content to be described in the text cannot be explicitly controlled. This paper tackles this problem by decoupling content selection from the decoder. The decoupled content selection is human interpretable, whose value can be manually manipulated to control the content of generated text. The model can be trained end-to-end without human annotations by maximizing a lower bound of the marginal likelihood. We further propose an effective way to trade-off between performance and controllability with a single adjustable hyperparameter. In both data-to-text and headline generation tasks, our model achieves promising results, paving the way for controllable content selection in text generation.

* EMNLP 2019

Via

Access Paper or Ask Questions

Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Jun 14, 2019
Hui Su, Xiaoyu Shen, Rongzhi Zhang, Fei Sun, Pengwei Hu, Cheng Niu, Jie Zhou

Figure 1 for Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Figure 2 for Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Figure 3 for Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Figure 4 for Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Recent research has made impressive progress in single-turn dialogue modelling. In the multi-turn setting, however, current models are still far from satisfactory. One major challenge is the frequently occurred coreference and information omission in our daily conversation, making it hard for machines to understand the real intention. In this paper, we propose rewriting the human utterance as a pre-process to help multi-turn dialgoue modelling. Each utterance is first rewritten to recover all coreferred and omitted information. The next processing steps are then performed based on the rewritten utterance. To properly train the utterance rewriter, we collect a new dataset with human annotations and introduce a Transformer-based utterance rewriting architecture using the pointer network. We show the proposed architecture achieves remarkably good performance on the utterance rewriting task. The trained utterance rewriter can be easily integrated into online chatbots and brings general improvement over different domains.

* Accepted to ACL 2019

Via

Access Paper or Ask Questions

NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation

Oct 07, 2018
Hui Su, Xiaoyu Shen, Wenjie Li, Dietrich Klakow

Figure 1 for NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation

Figure 2 for NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation

Figure 3 for NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation

Figure 4 for NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation

Sequence-to-Sequence (seq2seq) models have become overwhelmingly popular in building end-to-end trainable dialogue systems. Though highly efficient in learning the backbone of human-computer communications, they suffer from the problem of strongly favoring short generic responses. In this paper, we argue that a good response should smoothly connect both the preceding dialogue history and the following conversations. We strengthen this connection through mutual information maximization. To sidestep the non-differentiability of discrete natural language tokens, we introduce an auxiliary continuous code space and map such code space to a learnable prior distribution for generation purpose. Experiments on two dialogue datasets validate the effectiveness of our model, where the generated responses are closely related to the dialogue context and lead to more interactive conversations.

* Accepted by EMNLP2018

Via

Access Paper or Ask Questions

A Cost-Effective Framework for Preference Elicitation and Aggregation

Jul 07, 2018
Zhibing Zhao, Haoming Li, Junming Wang, Jeffrey Kephart, Nicholas Mattei, Hui Su, Lirong Xia

Figure 1 for A Cost-Effective Framework for Preference Elicitation and Aggregation

Figure 2 for A Cost-Effective Framework for Preference Elicitation and Aggregation

Figure 3 for A Cost-Effective Framework for Preference Elicitation and Aggregation

Figure 4 for A Cost-Effective Framework for Preference Elicitation and Aggregation

We propose a cost-effective framework for preference elicitation and aggregation under the Plackett-Luce model with features. Given a budget, our framework iteratively computes the most cost-effective elicitation questions in order to help the agents make a better group decision. We illustrate the viability of the framework with experiments on Amazon Mechanical Turk, which we use to estimate the cost of answering different types of elicitation questions. We compare the prediction accuracy of our framework when adopting various information criteria that evaluate the expected information gain from a question. Our experiments show carefully designed information criteria are much more efficient, i.e., they arrive at the correct answer using fewer queries, than randomly asking questions given the budget constraint.

* 12 pages, 5 figures

Via

Access Paper or Ask Questions

Improving Variational Encoder-Decoders in Dialogue Generation

Feb 06, 2018
Xiaoyu Shen, Hui Su, Shuzi Niu, Vera Demberg

Figure 1 for Improving Variational Encoder-Decoders in Dialogue Generation

Figure 2 for Improving Variational Encoder-Decoders in Dialogue Generation

Figure 3 for Improving Variational Encoder-Decoders in Dialogue Generation

Figure 4 for Improving Variational Encoder-Decoders in Dialogue Generation

Variational encoder-decoders (VEDs) have shown promising results in dialogue generation. However, the latent variable distributions are usually approximated by a much simpler model than the powerful RNN structure used for encoding and decoding, yielding the KL-vanishing problem and inconsistent training objective. In this paper, we separate the training step into two phases: The first phase learns to autoencode discrete texts into continuous embeddings, from which the second phase learns to generalize latent representations by reconstructing the encoded embedding. In this case, latent variables are sampled by transforming Gaussian noise through multi-layer perceptrons and are trained with a separate VED model, which has the potential of realizing a much more flexible distribution. We compare our model with current popular models and the experiment demonstrates substantial improvement in both metric-based and human evaluations.

* Accepted by AAAI2018

Via

Access Paper or Ask Questions

DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Oct 11, 2017
Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, Shuzi Niu

Figure 1 for DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Figure 2 for DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Figure 3 for DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Figure 4 for DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

We develop a high-quality multi-turn dialog dataset, DailyDialog, which is intriguing in several aspects. The language is human-written and less noisy. The dialogues in the dataset reflect our daily communication way and cover various topics about our daily life. We also manually label the developed dataset with communication intention and emotion information. Then, we evaluate existing approaches on DailyDialog dataset and hope it benefit the research field of dialog systems.

* accepted by IJCNLP 2017

Via

Access Paper or Ask Questions

Towards Cognitive-and-Immersive Systems: Experiments in a Shared (or common) Blockworld Framework

Sep 14, 2017
Matthew Peveler, Biplav Srivastava, Kartik Talamadupula, Naveen Sundar G., Selmer Bringsjord, Hui Su

Figure 1 for Towards Cognitive-and-Immersive Systems: Experiments in a Shared (or common) Blockworld Framework

Figure 2 for Towards Cognitive-and-Immersive Systems: Experiments in a Shared (or common) Blockworld Framework

Figure 3 for Towards Cognitive-and-Immersive Systems: Experiments in a Shared (or common) Blockworld Framework

Figure 4 for Towards Cognitive-and-Immersive Systems: Experiments in a Shared (or common) Blockworld Framework

As computational power has continued to increase, and sensors have become more accurate, the corresponding advent of systems that are cognitive-and-immersive (CAI) has come to pass. CAI systems fall squarely into the intersection of AI with HCI/HRI: such systems interact with and assist the human agents that enter them, in no small part because such systems are infused with AI able to understand and reason about these humans and their beliefs, goals, and plans. We herein explain our approach to engineering CAI systems. We emphasize the capacity of a CAI system to develop and reason over a "theory of the mind" of its humans partners. This capacity means that the AI in question has a sophisticated model of the beliefs, knowledge, goals, desires, emotions, etc. of these humans. To accomplish this engineering, a formal framework of very high expressivity is needed. In our case, this framework is a \textit{cognitive event calculus}, a partciular kind of quantified multi-modal logic, and a matching high-expressivity planner. To explain, advance, and to a degree validate our approach, we show that a calculus of this type can enable a CAI system to understand a psychologically tricky scenario couched in what we call the \textit{cognitive blockworld framework} (CBF). CBF includes machinery able to represent and plan over not merely blocks and actions, but also agents and their mental attitudes about other agents.

* Submitted to IAAI'18

Via

Access Paper or Ask Questions

A Conditional Variational Framework for Dialog Generation

Jul 06, 2017
Xiaoyu Shen, Hui Su, Yanran Li, Wenjie Li, Shuzi Niu, Yang Zhao, Akiko Aizawa, Guoping Long

Figure 1 for A Conditional Variational Framework for Dialog Generation

Figure 2 for A Conditional Variational Framework for Dialog Generation

Figure 3 for A Conditional Variational Framework for Dialog Generation

Figure 4 for A Conditional Variational Framework for Dialog Generation

Deep latent variable models have been shown to facilitate the response generation for open-domain dialog systems. However, these latent variables are highly randomized, leading to uncontrollable generated responses. In this paper, we propose a framework allowing conditional response generation based on specific attributes. These attributes can be either manually assigned or automatically detected. Moreover, the dialog states for both speakers are modeled separately in order to reflect personal features. We validate this framework on two different scenarios, where the attribute refers to genericness and sentiment states respectively. The experiment result testified the potential of our model, where meaningful responses can be generated in accordance with the specified attributes.

* Accepted by ACL2017

Via

Access Paper or Ask Questions