Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hannes Schulz

Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data

Apr 21, 2017

Shikhar Sharma, Jing He, Kaheer Suleman, Hannes Schulz, Philip Bachman

Figure 1 for Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data

Figure 2 for Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data

Figure 3 for Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data

Abstract:Natural language generation plays a critical role in spoken dialogue systems. We present a new approach to natural language generation for task-oriented dialogue using recurrent neural networks in an encoder-decoder framework. In contrast to previous work, our model uses both lexicalized and delexicalized components i.e. slot-value pairs for dialogue acts, with slots and corresponding values aligned together. This allows our model to learn from all available data including the slot-value pairing, rather than being restricted to delexicalized slots. We show that this helps our model generate more natural sentences with better grammar. We further improve our model's performance by transferring weights learnt from a pretrained sentence auto-encoder. Human evaluation of our best-performing model indicates that it generates sentences which users find more appealing.

Via

Access Paper or Ask Questions

Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems

Apr 13, 2017

Layla El Asri, Hannes Schulz, Shikhar Sharma, Jeremie Zumer, Justin Harris, Emery Fine, Rahul Mehrotra, Kaheer Suleman

Figure 1 for Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems

Figure 2 for Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems

Figure 3 for Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems

Figure 4 for Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems

Abstract:This paper presents the Frames dataset (Frames is available at http://datasets.maluuba.com/Frames), a corpus of 1369 human-human dialogues with an average of 15 turns per dialogue. We developed this dataset to study the role of memory in goal-oriented dialogue systems. Based on Frames, we introduce a task called frame tracking, which extends state tracking to a setting where several states are tracked simultaneously. We propose a baseline model for this task. We show that Frames can also be used to study memory in dialogue management and information presentation through natural language generation.

Via

Access Paper or Ask Questions

Policy Networks with Two-Stage Training for Dialogue Systems

Sep 12, 2016

Mehdi Fatemi, Layla El Asri, Hannes Schulz, Jing He, Kaheer Suleman

Figure 1 for Policy Networks with Two-Stage Training for Dialogue Systems

Figure 2 for Policy Networks with Two-Stage Training for Dialogue Systems

Figure 3 for Policy Networks with Two-Stage Training for Dialogue Systems

Figure 4 for Policy Networks with Two-Stage Training for Dialogue Systems

Abstract:In this paper, we propose to use deep policy networks which are trained with an advantage actor-critic method for statistically optimised dialogue systems. First, we show that, on summary state and action spaces, deep Reinforcement Learning (RL) outperforms Gaussian Processes methods. Summary state and action spaces lead to good performance but require pre-engineering effort, RL knowledge, and domain expertise. In order to remove the need to define such summary spaces, we show that deep RL can also be trained efficiently on the original state and action spaces. Dialogue systems based on partially observable Markov decision processes are known to require many dialogues to train, which makes them unappealing for practical deployment. We show that a deep RL method based on an actor-critic architecture can exploit a small amount of data very efficiently. Indeed, with only a few hundred dialogues collected with a handcrafted policy, the actor-critic deep learner is considerably bootstrapped from a combination of supervised and batch RL. In addition, convergence to an optimal policy is significantly sped up compared to other deep RL methods initialized on the data with batch RL. All experiments are performed on a restaurant domain derived from the Dialogue State Tracking Challenge 2 (DSTC2) dataset.

* Proceedings of the SIGDIAL 2016 Conference, pages 101--110, Los Angeles, USA, 13-15 September 2016. Association for Computational Linguistics
* SIGDial 2016 (Submitted: May 2016; Accepted: Jun 30, 2016)

Via

Access Paper or Ask Questions