Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sonal Gupta

Likelihood Ratios and Generative Classifiers for Unsupervised Out-of-Domain Detection In Task Oriented Dialog

Dec 30, 2019

Varun Gangal, Abhinav Arora, Arash Einolghozati, Sonal Gupta

Figure 1 for Likelihood Ratios and Generative Classifiers for Unsupervised Out-of-Domain Detection In Task Oriented Dialog

Figure 2 for Likelihood Ratios and Generative Classifiers for Unsupervised Out-of-Domain Detection In Task Oriented Dialog

Figure 3 for Likelihood Ratios and Generative Classifiers for Unsupervised Out-of-Domain Detection In Task Oriented Dialog

Figure 4 for Likelihood Ratios and Generative Classifiers for Unsupervised Out-of-Domain Detection In Task Oriented Dialog

Abstract:The task of identifying out-of-domain (OOD) input examples directly at test-time has seen renewed interest recently due to increased real world deployment of models. In this work, we focus on OOD detection for natural language sentence inputs to task-based dialog systems. Our findings are three-fold: First, we curate and release ROSTD (Real Out-of-Domain Sentences From Task-oriented Dialog) - a dataset of 4K OOD examples for the publicly available dataset from (Schuster et al. 2019). In contrast to existing settings which synthesize OOD examples by holding out a subset of classes, our examples were authored by annotators with apriori instructions to be out-of-domain with respect to the sentences in an existing dataset. Second, we explore likelihood ratio based approaches as an alternative to currently prevalent paradigms. Specifically, we reformulate and apply these approaches to natural language inputs. We find that they match or outperform the latter on all datasets, with larger improvements on non-artificial OOD benchmarks such as our dataset. Our ablations validate that specifically using likelihood ratios rather than plain likelihood is necessary to discriminate well between OOD and in-domain data. Third, we propose learning a generative classifier and computing a marginal likelihood (ratio) for OOD detection. This allows us to use a principled likelihood while at the same time exploiting training-time labels. We find that this approach outperforms both simple likelihood (ratio) based and other prior approaches. We are hitherto the first to investigate the use of generative classifiers for OOD detection at test-time.

* Accepted for AAAI-2020 Main Track

Via

Access Paper or Ask Questions

Improving Robustness of Task Oriented Dialog Systems

Nov 12, 2019

Arash Einolghozati, Sonal Gupta, Mrinal Mohit, Rushin Shah

Figure 1 for Improving Robustness of Task Oriented Dialog Systems

Figure 2 for Improving Robustness of Task Oriented Dialog Systems

Figure 3 for Improving Robustness of Task Oriented Dialog Systems

Figure 4 for Improving Robustness of Task Oriented Dialog Systems

Abstract:Task oriented language understanding in dialog systems is often modeled using intents (task of a query) and slots (parameters for that task). Intent detection and slot tagging are, in turn, modeled using sentence classification and word tagging techniques respectively. Similar to adversarial attack problems with computer vision models discussed in existing literature, these intent-slot tagging models are often over-sensitive to small variations in input -- predicting different and often incorrect labels when small changes are made to a query, thus reducing their accuracy and reliability. However, evaluating a model's robustness to these changes is harder for language since words are discrete and an automated change (e.g. adding `noise') to a query sometimes changes the meaning and thus labels of a query. In this paper, we first describe how to create an adversarial test set to measure the robustness of these models. Furthermore, we introduce and adapt adversarial training methods as well as data augmentation using back-translation to mitigate these issues. Our experiments show that both techniques improve the robustness of the system substantially and can be combined to yield the best results.

* 3rd Conversational AI Workshop at 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

Via

Access Paper or Ask Questions

Improving Semantic Parsing for Task Oriented Dialog

Feb 15, 2019

Arash Einolghozati, Panupong Pasupat, Sonal Gupta, Rushin Shah, Mrinal Mohit, Mike Lewis, Luke Zettlemoyer

Figure 1 for Improving Semantic Parsing for Task Oriented Dialog

Figure 2 for Improving Semantic Parsing for Task Oriented Dialog

Figure 3 for Improving Semantic Parsing for Task Oriented Dialog

Figure 4 for Improving Semantic Parsing for Task Oriented Dialog

Abstract:Semantic parsing using hierarchical representations has recently been proposed for task oriented dialog with promising results [Gupta et al 2018]. In this paper, we present three different improvements to the model: contextualized embeddings, ensembling, and pairwise re-ranking based on a language model. We taxonomize the errors possible for the hierarchical representation, such as wrong top intent, missing spans or split spans, and show that the three approaches correct different kinds of errors. The best model combines the three techniques and gives 6.4% better exact match accuracy than the state-of-the-art, with an error reduction of 33%, resulting in a new state-of-the-art result on the Task Oriented Parsing (TOP) dataset.

Via

Access Paper or Ask Questions

PyText: A Seamless Path from NLP research to production

Dec 12, 2018

Ahmed Aly, Kushal Lakhotia, Shicong Zhao, Mrinal Mohit, Barlas Oguz, Abhinav Arora, Sonal Gupta, Christopher Dewan, Stef Nelson-Lindall, Rushin Shah

Figure 1 for PyText: A Seamless Path from NLP research to production

Figure 2 for PyText: A Seamless Path from NLP research to production

Figure 3 for PyText: A Seamless Path from NLP research to production

Figure 4 for PyText: A Seamless Path from NLP research to production

Abstract:We introduce PyText - a deep learning based NLP modeling framework built on PyTorch. PyText addresses the often-conflicting requirements of enabling rapid experimentation and of serving models at scale. It achieves this by providing simple and extensible interfaces for model components, and by using PyTorch's capabilities of exporting models for inference via the optimized Caffe2 execution engine. We report our own experience of migrating experimentation and production workflows to PyText, which enabled us to iterate faster on novel modeling ideas and then seamlessly ship them at industrial scale.

Via

Access Paper or Ask Questions

Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog

Oct 31, 2018

Sebastian Schuster, Sonal Gupta, Rushin Shah, Mike Lewis

Figure 1 for Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog

Figure 2 for Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog

Figure 3 for Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog

Figure 4 for Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog

Abstract:One of the first steps in the utterance interpretation pipeline of many task-oriented conversational AI systems is to identify user intents and the corresponding slots. Neural sequence labeling models have achieved very high accuracy on these tasks when trained on large amounts of training data. However, collecting this data is very time-consuming and therefore it is unfeasible to collect large amounts of data for many languages. For this reason, it is desirable to make use of existing data in a high-resource language to train models in low-resource languages. In this paper, we investigate the performance of three different methods for cross-lingual transfer learning, namely (1) translating the training data, (2) using cross-lingual pre-trained embeddings, and (3) a novel method of using a multilingual machine translation encoder as contextual word representations. We find that given several hundred training examples in the the target language, the latter two methods outperform translating the training data. Further, in very low-resource settings, we find that multilingual contextual word representations give better results than using cross-lingual static embeddings. We release a dataset of around 57k annotated utterances in English (43k), Spanish (8.6k) and Thai (5k) for three task oriented domains at https://fb.me/multilingual_task_oriented_data.

* 10 pages

Via

Access Paper or Ask Questions

Semantic Parsing for Task Oriented Dialog using Hierarchical Representations

Oct 18, 2018

Sonal Gupta, Rushin Shah, Mrinal Mohit, Anuj Kumar, Mike Lewis

Figure 1 for Semantic Parsing for Task Oriented Dialog using Hierarchical Representations

Figure 2 for Semantic Parsing for Task Oriented Dialog using Hierarchical Representations

Figure 3 for Semantic Parsing for Task Oriented Dialog using Hierarchical Representations

Figure 4 for Semantic Parsing for Task Oriented Dialog using Hierarchical Representations

Abstract:Task oriented dialog systems typically first parse user utterances to semantic frames comprised of intents and slots. Previous work on task oriented intent and slot-filling work has been restricted to one intent per query and one slot label per token, and thus cannot model complex compositional requests. Alternative semantic parsing systems have represented queries as logical forms, but these are challenging to annotate and parse. We propose a hierarchical annotation scheme for semantic parsing that allows the representation of compositional queries, and can be efficiently and accurately parsed by standard constituency parsing models. We release a dataset of 44k annotated queries (fb.me/semanticparsingdialog), and show that parsing models outperform sequence-to-sequence approaches on this dataset.

* Conference on Empirical Methods in Natural Language Processing (EMNLP) 2018

Via

Access Paper or Ask Questions