Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wen-tau Yih

An Imitation Game for Learning Semantic Parsers from User Interaction

May 02, 2020
Ziyu Yao, Yiqi Tang, Wen-tau Yih, Huan Sun, Yu Su

Figure 1 for An Imitation Game for Learning Semantic Parsers from User Interaction

Figure 2 for An Imitation Game for Learning Semantic Parsers from User Interaction

Figure 3 for An Imitation Game for Learning Semantic Parsers from User Interaction

Figure 4 for An Imitation Game for Learning Semantic Parsers from User Interaction

Despite the widely successful applications, bootstrapping and fine-tuning semantic parsers are still a tedious process with challenges such as costly data annotation and privacy risks. In this paper, we suggest an alternative, human-in-the-loop methodology for learning semantic parsers directly from users. A semantic parser should be introspective of its uncertainties and prompt for user demonstration when uncertain. In doing so it also gets to imitate the user behavior and continue improving itself autonomously with the hope that eventually it may become as good as the user in interpreting their questions. To combat the sparsity of demonstration, we propose a novel annotation-efficient imitation learning algorithm, which iteratively collects new datasets by mixing demonstrated states and confident predictions and re-trains the semantic parser in a Dataset Aggregation fashion (Ross et al., 2011). We provide a theoretical analysis of its cost bound and also empirically demonstrate its promising performance on the text-to-SQL problem.

* 17 pages, 6 figures

Via

Access Paper or Ask Questions

Dense Passage Retrieval for Open-Domain Question Answering

May 02, 2020
Vladimir Karpukhin, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih

Figure 1 for Dense Passage Retrieval for Open-Domain Question Answering

Figure 2 for Dense Passage Retrieval for Open-Domain Question Answering

Figure 3 for Dense Passage Retrieval for Open-Domain Question Answering

Figure 4 for Dense Passage Retrieval for Open-Domain Question Answering

Open-domain question answering relies on efficient passage retrieval to select candidate contexts, where traditional sparse vector space models, such as TF-IDF or BM25, are the de facto method. In this work, we show that retrieval can be practically implemented using dense representations alone, where embeddings are learned from a small number of questions and passages by a simple dual-encoder framework. When evaluated on a wide range of open-domain QA datasets, our dense retriever outperforms a strong Lucene-BM25 system largely by 9%-19% absolute in terms of top-20 passage retrieval accuracy, and helps our end-to-end QA system establish new state-of-the-art on multiple open-domain QA benchmarks.

* corrected typos in Table 3; add a paragraph in Sec. 6.2

Via

Access Paper or Ask Questions

Unsupervised Question Decomposition for Question Answering

Feb 22, 2020
Ethan Perez, Patrick Lewis, Wen-tau Yih, Kyunghyun Cho, Douwe Kiela

Figure 1 for Unsupervised Question Decomposition for Question Answering

Figure 2 for Unsupervised Question Decomposition for Question Answering

Figure 3 for Unsupervised Question Decomposition for Question Answering

Figure 4 for Unsupervised Question Decomposition for Question Answering

We aim to improve question answering (QA) by decomposing hard questions into easier sub-questions that existing QA systems can answer. Since collecting labeled decompositions is cumbersome, we propose an unsupervised approach to produce sub-questions. Specifically, by leveraging >10M questions from Common Crawl, we learn to map from the distribution of multi-hop questions to the distribution of single-hop sub-questions. We answer sub-questions with an off-the-shelf QA model and incorporate the resulting answers in a downstream, multi-hop QA system. On a popular multi-hop QA dataset, HotpotQA, we show large improvements over a strong baseline, especially on adversarial and out-of-domain questions. Our method is generally applicable and automatically learns to decompose questions of different classes, while matching the performance of decomposition methods that rely heavily on hand-engineering and annotation.

Via

Access Paper or Ask Questions

Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study

Oct 11, 2019
Ziyu Yao, Yu Su, Huan Sun, Wen-tau Yih

Figure 1 for Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study

Figure 2 for Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study

Figure 3 for Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study

Figure 4 for Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study

As a promising paradigm, interactive semantic parsing has shown to improve both semantic parsing accuracy and user confidence in the results. In this paper, we propose a new, unified formulation of the interactive semantic parsing problem, where the goal is to design a model-based intelligent agent. The agent maintains its own state as the current predicted semantic parse, decides whether and where human intervention is needed, and generates a clarification question in natural language. A key part of the agent is a world model: it takes a percept (either an initial question or subsequent feedback from the user) and transitions to a new state. We then propose a simple yet remarkably effective instantiation of our framework, demonstrated on two text-to-SQL datasets (WikiSQL and Spider) with different state-of-the-art base semantic parsers. Compared to an existing interactive semantic parsing approach that treats the base parser as a black box, our approach solicits less user feedback but yields higher run-time accuracy.

* 14 pages, 4 figures, accepted to EMNLP 2019

Via

Access Paper or Ask Questions

Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text

Sep 18, 2019
Bhavana Dalvi Mishra, Niket Tandon, Antoine Bosselut, Wen-tau Yih, Peter Clark

Figure 1 for Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text

Figure 2 for Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text

Figure 3 for Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text

Figure 4 for Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text

Our goal is to better comprehend procedural text, e.g., a paragraph about photosynthesis, by not only predicting what happens, but why some actions need to happen before others. Our approach builds on a prior process comprehension framework for predicting actions' effects, to also identify subsequent steps that those effects enable. We present our new model (XPAD) that biases effect predictions towards those that (1) explain more of the actions in the paragraph and (2) are more plausible with respect to background knowledge. We also extend an existing benchmark dataset for procedural text comprehension, ProPara, by adding the new task of explaining actions by predicting their dependencies. We find that XPAD significantly outperforms prior systems on this task, while maintaining the performance on the original task in ProPara. The dataset is available at http://data.allenai.org/propara

* Accepted to EMNLP 2019 as a long paper. This revision fixed a typo in an author name in references

Via

Access Paper or Ask Questions

Be Consistent! Improving Procedural Text Comprehension using Label Consistency

Jun 21, 2019
Xinya Du, Bhavana Dalvi Mishra, Niket Tandon, Antoine Bosselut, Wen-tau Yih, Peter Clark, Claire Cardie

Figure 1 for Be Consistent! Improving Procedural Text Comprehension using Label Consistency

Figure 2 for Be Consistent! Improving Procedural Text Comprehension using Label Consistency

Figure 3 for Be Consistent! Improving Procedural Text Comprehension using Label Consistency

Figure 4 for Be Consistent! Improving Procedural Text Comprehension using Label Consistency

Our goal is procedural text comprehension, namely tracking how the properties of entities (e.g., their location) change with time given a procedural text (e.g., a paragraph about photosynthesis, a recipe). This task is challenging as the world is changing throughout the text, and despite recent advances, current systems still struggle with this task. Our approach is to leverage the fact that, for many procedural texts, multiple independent descriptions are readily available, and that predictions from them should be consistent (label consistency). We present a new learning framework that leverages label consistency during training, allowing consistency bias to be built into the model. Evaluation on a standard benchmark dataset for procedural text, ProPara (Dalvi et al., 2018), shows that our approach significantly improves prediction performance (F1) over prior state-of-the-art systems.

* NAACL 2019

Via

Access Paper or Ask Questions

QuaRel: A Dataset and Models for Answering Questions about Qualitative Relationships

Nov 20, 2018
Oyvind Tafjord, Peter Clark, Matt Gardner, Wen-tau Yih, Ashish Sabharwal

Figure 1 for QuaRel: A Dataset and Models for Answering Questions about Qualitative Relationships

Figure 2 for QuaRel: A Dataset and Models for Answering Questions about Qualitative Relationships

Figure 3 for QuaRel: A Dataset and Models for Answering Questions about Qualitative Relationships

Figure 4 for QuaRel: A Dataset and Models for Answering Questions about Qualitative Relationships

Many natural language questions require recognizing and reasoning with qualitative relationships (e.g., in science, economics, and medicine), but are challenging to answer with corpus-based methods. Qualitative modeling provides tools that support such reasoning, but the semantic parsing task of mapping questions into those models has formidable challenges. We present QuaRel, a dataset of diverse story questions involving qualitative relationships that characterize these challenges, and techniques that begin to address them. The dataset has 2771 questions relating 19 different types of quantities. For example, "Jenny observes that the robot vacuum cleaner moves slower on the living room carpet than on the bedroom carpet. Which carpet has more friction?" We contribute (1) a simple and flexible conceptual framework for representing these kinds of questions; (2) the QuaRel dataset, including logical forms, exemplifying the parsing challenges; and (3) two novel models for this task, built as extensions of type-constrained semantic parsing. The first of these models (called QuaSP+) significantly outperforms off-the-shelf tools on QuaRel. The second (QuaSP+Zero) demonstrates zero-shot capability, i.e., the ability to handle new qualitative relationships without requiring additional training data, something not possible with previous models. This work thus makes inroads into answering complex, qualitative questions that require reasoning, and scaling to new relationships at low cost. The dataset and models are available at http://data.allenai.org/quarel.

* 9 pages, AAAI 2019

Via

Access Paper or Ask Questions

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

Oct 06, 2018
Hsin-Yuan Huang, Eunsol Choi, Wen-tau Yih

Figure 1 for FlowQA: Grasping Flow in History for Conversational Machine Comprehension

Figure 2 for FlowQA: Grasping Flow in History for Conversational Machine Comprehension

Figure 3 for FlowQA: Grasping Flow in History for Conversational Machine Comprehension

Figure 4 for FlowQA: Grasping Flow in History for Conversational Machine Comprehension

Conversational machine comprehension requires a deep understanding of the conversation history. To enable traditional, single-turn models to encode the history comprehensively, we introduce Flow, a mechanism that can incorporate intermediate representations generated during the process of answering previous questions, through an alternating parallel processing structure. Compared to shallow approaches that concatenate previous questions/answers as input, Flow integrates the latent semantics of the conversation history more deeply. Our model, FlowQA, shows superior performance on two recently proposed conversational challenges (+7.2% F1 on CoQA and +4.0% on QuAC). The effectiveness of Flow also shows in other tasks. By reducing sequential instruction understanding to conversational machine comprehension, FlowQA outperforms the best models on all three domains in SCONE, with +1.8% to +4.4% improvement in accuracy.

* 11 pages, 4 figures

Via

Access Paper or Ask Questions