Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stefan Ultes

Improving Interaction Quality Estimation with BiLSTMs and the Impact on Dialogue Policy Learning

Jan 21, 2020

Stefan Ultes

Figure 1 for Improving Interaction Quality Estimation with BiLSTMs and the Impact on Dialogue Policy Learning

Figure 2 for Improving Interaction Quality Estimation with BiLSTMs and the Impact on Dialogue Policy Learning

Figure 3 for Improving Interaction Quality Estimation with BiLSTMs and the Impact on Dialogue Policy Learning

Figure 4 for Improving Interaction Quality Estimation with BiLSTMs and the Impact on Dialogue Policy Learning

Abstract:Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has been in the focus of research for many years. While most work which is based on reinforcement learning employs an objective measure like task success for modelling the reward signal, we use a reward based on user satisfaction estimation. We propose a novel estimator and show that it outperforms all previous estimators while learning temporal dependencies implicitly. Furthermore, we apply this novel user satisfaction estimation model live in simulated experiments where the satisfaction estimation model is trained on one domain and applied in many other domains which cover a similar task. We show that applying this model results in higher estimated satisfaction, similar task success rates and a higher robustness to noise.

* Published at SIGDIAL 2019

Via

Access Paper or Ask Questions

Addressing Objects and Their Relations: The Conversational Entity Dialogue Model

Jan 05, 2019

Stefan Ultes, Paweł\ Budzianowski, Iñigo Casanueva, Lina Rojas-Barahona, Bo-Hsiang Tseng, Yen-Chen Wu, Steve Young, Milica Gašić

Figure 1 for Addressing Objects and Their Relations: The Conversational Entity Dialogue Model

Figure 2 for Addressing Objects and Their Relations: The Conversational Entity Dialogue Model

Figure 3 for Addressing Objects and Their Relations: The Conversational Entity Dialogue Model

Figure 4 for Addressing Objects and Their Relations: The Conversational Entity Dialogue Model

Abstract:Statistical spoken dialogue systems usually rely on a single- or multi-domain dialogue model that is restricted in its capabilities of modelling complex dialogue structures, e.g., relations. In this work, we propose a novel dialogue model that is centred around entities and is able to model relations as well as multiple entities of the same type. We demonstrate in a prototype implementation benefits of relation modelling on the dialogue level and show that a trained policy using these relations outperforms the multi-domain baseline. Furthermore, we show that by modelling the relations on the dialogue level, the system is capable of processing relations present in the user input and even learns to address them in the system response.

* Accepted at SIGDial 2018

Via

Access Paper or Ask Questions

Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

Dec 20, 2018

Bo-Hsiang Tseng, Florian Kreyssig, Pawel Budzianowski, Inigo Casanueva, Yen-Chen Wu, Stefan Ultes, Milica Gasic

Figure 1 for Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

Figure 2 for Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

Figure 3 for Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

Figure 4 for Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

Abstract:Cross-domain natural language generation (NLG) is still a difficult task within spoken dialogue modelling. Given a semantic representation provided by the dialogue manager, the language generator should generate sentences that convey desired information. Traditional template-based generators can produce sentences with all necessary information, but these sentences are not sufficiently diverse. With RNN-based models, the diversity of the generated sentences can be high, however, in the process some information is lost. In this work, we improve an RNN-based generator by considering latent information at the sentence level during generation using the conditional variational autoencoder architecture. We demonstrate that our model outperforms the original RNN-based generator, while yielding highly diverse sentences. In addition, our model performs better when the training data is limited.

* Sigdial 2018

Via

Access Paper or Ask Questions

MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Sep 29, 2018

Paweł Budzianowski, Tsung-Hsien Wen, Bo-Hsiang Tseng, Iñigo Casanueva, Stefan Ultes, Osman Ramadan, Milica Gašić

Figure 1 for MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Figure 2 for MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Figure 3 for MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Figure 4 for MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Abstract:Even though machine learning has become the major scene in dialogue research community, the real breakthrough has been blocked by the scale of data available. To address this fundamental obstacle, we introduce the Multi-Domain Wizard-of-Oz dataset (MultiWOZ), a fully-labeled collection of human-human written conversations spanning over multiple domains and topics. At a size of $10$k dialogues, it is at least one order of magnitude larger than all previous annotated task-oriented corpora. The contribution of this work apart from the open-sourced dataset labelled with dialogue belief states and dialogue actions is two-fold: firstly, a detailed description of the data collection procedure along with a summary of data structure and analysis is provided. The proposed data-collection pipeline is entirely based on crowd-sourcing without the need of hiring professional annotators; secondly, a set of benchmark results of belief tracking, dialogue act and response generation is reported, which shows the usability of the data and sets a baseline for future studies.

* Accepted for publication at EMNLP 2018

Via

Access Paper or Ask Questions

Deep learning for language understanding of mental health concepts derived from Cognitive Behavioural Therapy

Sep 03, 2018

Lina Rojas-Barahona, Bo-Hsiang Tseng, Yinpei Dai, Clare Mansfield, Osman Ramadan, Stefan Ultes, Michael Crawford, Milica Gasic

Figure 1 for Deep learning for language understanding of mental health concepts derived from Cognitive Behavioural Therapy

Figure 2 for Deep learning for language understanding of mental health concepts derived from Cognitive Behavioural Therapy

Figure 3 for Deep learning for language understanding of mental health concepts derived from Cognitive Behavioural Therapy

Figure 4 for Deep learning for language understanding of mental health concepts derived from Cognitive Behavioural Therapy

Abstract:In recent years, we have seen deep learning and distributed representations of words and sentences make impact on a number of natural language processing tasks, such as similarity, entailment and sentiment analysis. Here we introduce a new task: understanding of mental health concepts derived from Cognitive Behavioural Therapy (CBT). We define a mental health ontology based on the CBT principles, annotate a large corpus where this phenomena is exhibited and perform understanding using deep learning and distributed representations. Our results show that the performance of deep learning models combined with word embeddings or sentence embeddings significantly outperform non-deep-learning models in this difficult task. This understanding module will be an essential component of a statistical dialogue system delivering therapy.

* Accepted for publication at LOUHI 2018: The Ninth International Workshop on Health Text Mining and Information Analysis

Via

Access Paper or Ask Questions

Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

Jun 21, 2018

Lina M. Rojas-Barahona, Stefan Ultes, Pawel Budzianowski, Iñigo Casanueva, Milica Gasic, Bo-Hsiang Tseng, Steve Young

Figure 1 for Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

Figure 2 for Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

Figure 3 for Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

Figure 4 for Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

Abstract:This paper presents two ways of dealing with scarce data in semantic decoding using N-Best speech recognition hypotheses. First, we learn features by using a deep learning architecture in which the weights for the unknown and known categories are jointly optimised. Second, an unsupervised method is used for further tuning the weights. Sharing weights injects prior knowledge to unknown categories. The unsupervised tuning (i.e. the risk minimisation) improves the F-Measure when recognising nearly zero-shot data on the DSTC3 corpus. This unsupervised method can be applied subject to two assumptions: the rank of the class marginal is assumed to be known and the class-conditional scores of the classifier are assumed to follow a Gaussian distribution.

Via

Access Paper or Ask Questions

A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Apr 06, 2018

Iñigo Casanueva, Paweł Budzianowski, Pei-Hao Su, Nikola Mrkšić, Tsung-Hsien Wen, Stefan Ultes, Lina Rojas-Barahona, Steve Young, Milica Gašić

Figure 1 for A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Figure 2 for A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Figure 3 for A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Figure 4 for A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Abstract:Dialogue assistants are rapidly becoming an indispensable daily aid. To avoid the significant effort needed to hand-craft the required dialogue flow, the Dialogue Management (DM) module can be cast as a continuous Markov Decision Process (MDP) and trained through Reinforcement Learning (RL). Several RL models have been investigated over recent years. However, the lack of a common benchmarking framework makes it difficult to perform a fair comparison between different models and their capability to generalise to different environments. Therefore, this paper proposes a set of challenging simulated environments for dialogue model development and evaluation. To provide some baselines, we investigate a number of representative parametric algorithms, namely deep reinforcement learning algorithms - DQN, A2C and Natural Actor-Critic and compare them to a non-parametric model, GP-SARSA. Both the environments and policy models are implemented using the publicly available PyDial toolkit and released on-line, in order to establish a testbed framework for further experiments and to facilitate experimental reproducibility.

* Accepted at the Deep Reinforcement Learning Symposium, 31st Conference on Neural Information Processing Systems (NIPS 2017) Paper updated with minor changes

Via

Access Paper or Ask Questions

Feudal Reinforcement Learning for Dialogue Management in Large Domains

Mar 08, 2018

Iñigo Casanueva, Paweł Budzianowski, Pei-Hao Su, Stefan Ultes, Lina Rojas-Barahona, Bo-Hsiang Tseng, Milica Gašić

Figure 1 for Feudal Reinforcement Learning for Dialogue Management in Large Domains

Figure 2 for Feudal Reinforcement Learning for Dialogue Management in Large Domains

Figure 3 for Feudal Reinforcement Learning for Dialogue Management in Large Domains

Figure 4 for Feudal Reinforcement Learning for Dialogue Management in Large Domains

Abstract:Reinforcement learning (RL) is a promising approach to solve dialogue policy optimisation. Traditional RL algorithms, however, fail to scale to large domains due to the curse of dimensionality. We propose a novel Dialogue Management architecture, based on Feudal RL, which decomposes the decision into two steps; a first step where a master policy selects a subset of primitive actions, and a second step where a primitive action is chosen from the selected subset. The structural information included in the domain ontology is used to abstract the dialogue state space, taking the decisions at each step using different parts of the abstracted state. This, combined with an information sharing mechanism between slots, increases the scalability to large domains. We show that an implementation of this approach, based on Deep-Q Networks, significantly outperforms previous state of the art in several dialogue domains and environments, without the need of any additional reward signal.

* Accepted as a short paper in NAACL 2018

Via

Access Paper or Ask Questions

Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

Jul 19, 2017

Stefan Ultes, Paweł Budzianowski, Iñigo Casanueva, Nikola Mrkšić, Lina Rojas-Barahona, Pei-Hao Su, Tsung-Hsien Wen, Milica Gašić, Steve Young

Figure 1 for Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

Figure 2 for Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

Figure 3 for Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

Abstract:Reinforcement learning is widely used for dialogue policy optimization where the reward function often consists of more than one component, e.g., the dialogue success and the dialogue length. In this work, we propose a structured method for finding a good balance between these components by searching for the optimal reward component weighting. To render this search feasible, we use multi-objective reinforcement learning to significantly reduce the number of training dialogues required. We apply our proposed method to find optimized component weights for six domains and compare them to a default baseline.

* Accepted at SIGDial 2017

Via

Access Paper or Ask Questions

Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

Jul 17, 2017

Paweł Budzianowski, Stefan Ultes, Pei-Hao Su, Nikola Mrkšić, Tsung-Hsien Wen, Iñigo Casanueva, Lina Rojas-Barahona, Milica Gašić

Figure 1 for Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

Figure 2 for Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

Figure 3 for Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

Figure 4 for Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

Abstract:Human conversation is inherently complex, often spanning many different topics/domains. This makes policy learning for dialogue systems very challenging. Standard flat reinforcement learning methods do not provide an efficient framework for modelling such dialogues. In this paper, we focus on the under-explored problem of multi-domain dialogue management. First, we propose a new method for hierarchical reinforcement learning using the option framework. Next, we show that the proposed architecture learns faster and arrives at a better policy than the existing flat ones do. Moreover, we show how pretrained policies can be adapted to more complex systems with an additional set of new actions. In doing that, we show that our approach has the potential to facilitate policy optimisation for more sophisticated multi-domain dialogue systems.

* Update of the section 4 and the bibliography

Via

Access Paper or Ask Questions