Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Steve Young

Neural Belief Tracker: Data-Driven Dialogue State Tracking

Apr 21, 2017

Nikola Mrkšić, Diarmuid Ó Séaghdha, Tsung-Hsien Wen, Blaise Thomson, Steve Young

Figure 1 for Neural Belief Tracker: Data-Driven Dialogue State Tracking

Figure 2 for Neural Belief Tracker: Data-Driven Dialogue State Tracking

Figure 3 for Neural Belief Tracker: Data-Driven Dialogue State Tracking

Figure 4 for Neural Belief Tracker: Data-Driven Dialogue State Tracking

Abstract:One of the core components of modern spoken dialogue systems is the belief tracker, which estimates the user's goal at every step of the dialogue. However, most current approaches have difficulty scaling to larger, more complex dialogue domains. This is due to their dependency on either: a) Spoken Language Understanding models that require large amounts of annotated training data; or b) hand-crafted lexicons for capturing some of the linguistic variation in users' language. We propose a novel Neural Belief Tracking (NBT) framework which overcomes these problems by building on recent advances in representation learning. NBT models reason over pre-trained word vectors, learning to compose them into distributed representations of user utterances and dialogue context. Our evaluation on two datasets shows that this approach surpasses past limitations, matching the performance of state-of-the-art models which rely on hand-crafted semantic lexicons and outperforming them when such lexicons are not provided.

* Accepted as a long paper for the 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017)

Via

Access Paper or Ask Questions

Exploiting Sentence and Context Representations in Deep Neural Models for Spoken Language Understanding

Oct 13, 2016

Lina M. Rojas Barahona, Milica Gasic, Nikola Mrkšić, Pei-Hao Su, Stefan Ultes, Tsung-Hsien Wen, Steve Young

Figure 1 for Exploiting Sentence and Context Representations in Deep Neural Models for Spoken Language Understanding

Figure 2 for Exploiting Sentence and Context Representations in Deep Neural Models for Spoken Language Understanding

Figure 3 for Exploiting Sentence and Context Representations in Deep Neural Models for Spoken Language Understanding

Figure 4 for Exploiting Sentence and Context Representations in Deep Neural Models for Spoken Language Understanding

Abstract:This paper presents a deep learning architecture for the semantic decoder component of a Statistical Spoken Dialogue System. In a slot-filling dialogue, the semantic decoder predicts the dialogue act and a set of slot-value pairs from a set of n-best hypotheses returned by the Automatic Speech Recognition. Most current models for spoken language understanding assume (i) word-aligned semantic annotations as in sequence taggers and (ii) delexicalisation, or a mapping of input words to domain-specific concepts using heuristics that try to capture morphological variation but that do not scale to other domains nor to language variation (e.g., morphology, synonyms, paraphrasing ). In this work the semantic decoder is trained using unaligned semantic annotations and it uses distributed semantic representation learning to overcome the limitations of explicit delexicalisation. The proposed architecture uses a convolutional neural network for the sentence representation and a long-short term memory network for the context representation. Results are presented for the publicly available DSTC2 corpus and an In-car corpus which is similar to DSTC2 but has a significantly higher word error rate (WER).

Via

Access Paper or Ask Questions

Dialogue manager domain adaptation using Gaussian process reinforcement learning

Sep 09, 2016

Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

Figure 1 for Dialogue manager domain adaptation using Gaussian process reinforcement learning

Figure 2 for Dialogue manager domain adaptation using Gaussian process reinforcement learning

Figure 3 for Dialogue manager domain adaptation using Gaussian process reinforcement learning

Figure 4 for Dialogue manager domain adaptation using Gaussian process reinforcement learning

Abstract:Spoken dialogue systems allow humans to interact with machines using natural speech. As such, they have many benefits. By using speech as the primary communication medium, a computer interface can facilitate swift, human-like acquisition of information. In recent years, speech interfaces have become ever more popular, as is evident from the rise of personal assistants such as Siri, Google Now, Cortana and Amazon Alexa. Recently, data-driven machine learning methods have been applied to dialogue modelling and the results achieved for limited-domain applications are comparable to or outperform traditional approaches. Methods based on Gaussian processes are particularly effective as they enable good models to be estimated from limited training data. Furthermore, they provide an explicit estimate of the uncertainty which is particularly useful for reinforcement learning. This article explores the additional steps that are necessary to extend these methods to model multiple dialogue domains. We show that Gaussian process reinforcement learning is an elegant framework that naturally supports a range of methods, including prior knowledge, Bayesian committee machines and multi-agent learning, for facilitating extensible and adaptable dialogue systems.

* accepted for publication in Computer Speech and Language

Via

Access Paper or Ask Questions

Conditional Generation and Snapshot Learning in Neural Dialogue Systems

Jun 10, 2016

Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, David Vandyke, Steve Young

Figure 1 for Conditional Generation and Snapshot Learning in Neural Dialogue Systems

Figure 2 for Conditional Generation and Snapshot Learning in Neural Dialogue Systems

Figure 3 for Conditional Generation and Snapshot Learning in Neural Dialogue Systems

Figure 4 for Conditional Generation and Snapshot Learning in Neural Dialogue Systems

Abstract:Recently a variety of LSTM-based conditional language models (LM) have been applied across a range of language generation tasks. In this work we study various model architectures and different ways to represent and aggregate the source information in an end-to-end neural dialogue system framework. A method called snapshot learning is also proposed to facilitate learning from supervised sequential signals by applying a companion cross-entropy objective function to the conditioning vector. The experimental and analytical results demonstrate firstly that competition occurs between the conditioning vector and the LM, and the differing architectures provide different trade-offs between the two. Secondly, the discriminative power and transparency of the conditioning vector is key to providing both model interpretability and better performance. Thirdly, snapshot learning leads to consistent performance improvements independent of which architecture is used.

Via

Access Paper or Ask Questions

Continuously Learning Neural Dialogue Management

Jun 08, 2016

Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

Figure 1 for Continuously Learning Neural Dialogue Management

Figure 2 for Continuously Learning Neural Dialogue Management

Figure 3 for Continuously Learning Neural Dialogue Management

Figure 4 for Continuously Learning Neural Dialogue Management

Abstract:We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A unified neural network framework is proposed to enable the system to first learn by supervision from a set of dialogue data and then continuously improve its behaviour via reinforcement learning, all using gradient-based algorithms on one single model. The experiments demonstrate the supervised model's effectiveness in the corpus-based evaluation, with user simulation, and with paid human subjects. The use of reinforcement learning further improves the model's performance in both interactive settings, especially under higher-noise conditions.

Via

Access Paper or Ask Questions

On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

Jun 02, 2016

Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

Figure 1 for On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

Figure 2 for On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

Figure 3 for On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

Figure 4 for On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

Abstract:The ability to compute an accurate reward function is essential for optimising a dialogue policy via reinforcement learning. In real-world applications, using explicit user feedback as the reward signal is often unreliable and costly to collect. This problem can be mitigated if the user's intent is known in advance or data is available to pre-train a task success predictor off-line. In practice neither of these apply for most real world applications. Here we propose an on-line learning framework whereby the dialogue policy is jointly trained alongside the reward model via active learning with a Gaussian process model. This Gaussian process operates on a continuous space dialogue representation generated in an unsupervised fashion using a recurrent neural network encoder-decoder. The experimental results demonstrate that the proposed framework is able to significantly reduce data annotation costs and mitigate noisy user feedback in dialogue policy learning.

* Accepted as a long paper in ACL 2016

Via

Access Paper or Ask Questions

Multi-domain Neural Network Language Generation for Spoken Dialogue Systems

Mar 03, 2016

Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, David Vandyke, Steve Young

Figure 1 for Multi-domain Neural Network Language Generation for Spoken Dialogue Systems

Figure 2 for Multi-domain Neural Network Language Generation for Spoken Dialogue Systems

Figure 3 for Multi-domain Neural Network Language Generation for Spoken Dialogue Systems

Figure 4 for Multi-domain Neural Network Language Generation for Spoken Dialogue Systems

Abstract:Moving from limited-domain natural language generation (NLG) to open domain is difficult because the number of semantic input combinations grows exponentially with the number of domains. Therefore, it is important to leverage existing resources and exploit similarities between domains to facilitate domain adaptation. In this paper, we propose a procedure to train multi-domain, Recurrent Neural Network-based (RNN) language generators via multiple adaptation steps. In this procedure, a model is first trained on counterfeited data synthesised from an out-of-domain dataset, and then fine tuned on a small set of in-domain utterances with a discriminative objective function. Corpus-based evaluation results show that the proposed procedure can achieve competitive performance in terms of BLEU score and slot error rate while significantly reducing the data needed to train generators in new, unseen domains. In subjective testing, human judges confirm that the procedure greatly improves generator performance when only a small amount of data is available in the domain.

* Accepted as a long paper in NAACL-HLT 2016

Via

Access Paper or Ask Questions

Counter-fitting Word Vectors to Linguistic Constraints

Mar 02, 2016

Nikola Mrkšić, Diarmuid Ó Séaghdha, Blaise Thomson, Milica Gašić, Lina Rojas-Barahona, Pei-Hao Su, David Vandyke, Tsung-Hsien Wen, Steve Young

Figure 1 for Counter-fitting Word Vectors to Linguistic Constraints

Figure 2 for Counter-fitting Word Vectors to Linguistic Constraints

Figure 3 for Counter-fitting Word Vectors to Linguistic Constraints

Figure 4 for Counter-fitting Word Vectors to Linguistic Constraints

Abstract:In this work, we present a novel counter-fitting method which injects antonymy and synonymy constraints into vector space representations in order to improve the vectors' capability for judging semantic similarity. Applying this method to publicly available pre-trained word vectors leads to a new state of the art performance on the SimLex-999 dataset. We also show how the method can be used to tailor the word vector space for the downstream task of dialogue state tracking, resulting in robust improvements across different dialogue domains.

* Paper accepted for the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2016)

Via

Access Paper or Ask Questions

Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems

Aug 26, 2015

Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Pei-Hao Su, David Vandyke, Steve Young

Figure 1 for Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems

Figure 2 for Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems

Figure 3 for Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems

Figure 4 for Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems

Abstract:Natural language generation (NLG) is a critical component of spoken dialogue and it has a significant impact both on usability and perceived quality. Most NLG systems in common use employ rules and heuristics and tend to generate rigid and stylised responses without the natural variation of human language. They are also not easily scaled to systems covering multiple domains and languages. This paper presents a statistical language generator based on a semantically controlled Long Short-term Memory (LSTM) structure. The LSTM generator can learn from unaligned data by jointly optimising sentence planning and surface realisation using a simple cross entropy training criterion, and language variation can be easily achieved by sampling from output candidates. With fewer heuristics, an objective evaluation in two differing test domains showed the proposed method improved performance compared to previous methods. Human judges scored the LSTM system higher on informativeness and naturalness and overall preferred it to the other systems.

* To be appear in EMNLP 2015

Via

Access Paper or Ask Questions

Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems

Aug 18, 2015

Pei-Hao Su, David Vandyke, Milica Gasic, Nikola Mrksic, Tsung-Hsien Wen, Steve Young

Figure 1 for Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems

Figure 2 for Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems

Figure 3 for Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems

Figure 4 for Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems

Abstract:Statistical spoken dialogue systems have the attractive property of being able to be optimised from data via interactions with real users. However in the reinforcement learning paradigm the dialogue manager (agent) often requires significant time to explore the state-action space to learn to behave in a desirable manner. This is a critical issue when the system is trained on-line with real users where learning costs are expensive. Reward shaping is one promising technique for addressing these concerns. Here we examine three recurrent neural network (RNN) approaches for providing reward shaping information in addition to the primary (task-orientated) environmental feedback. These RNNs are trained on returns from dialogues generated by a simulated user and attempt to diffuse the overall evaluation of the dialogue back down to the turn level to guide the agent towards good behaviour faster. In both simulated and real user scenarios these RNNs are shown to increase policy learning speed. Importantly, they do not require prior knowledge of the user's goal.

* Accepted for publication in SigDial 2015

Via

Access Paper or Ask Questions