Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Panupong Pasupat

Mapping Natural Language Commands to Web Elements

Oct 01, 2018

Panupong Pasupat, Tian-Shun Jiang, Evan Zheran Liu, Kelvin Guu, Percy Liang

Figure 1 for Mapping Natural Language Commands to Web Elements

Figure 2 for Mapping Natural Language Commands to Web Elements

Figure 3 for Mapping Natural Language Commands to Web Elements

Figure 4 for Mapping Natural Language Commands to Web Elements

Abstract:The web provides a rich, open-domain environment with textual, structural, and spatial properties. We propose a new task for grounding language in this environment: given a natural language command (e.g., "click on the second article"), choose the correct element on the web page (e.g., a hyperlink or text box). We collected a dataset of over 50,000 commands that capture various phenomena such as functional references (e.g. "find who made this site"), relational reasoning (e.g. "article by john"), and visual reasoning (e.g. "top-most article"). We also implemented and analyzed three baseline models that capture different phenomena present in the dataset.

* EMNLP 2018

Via

Access Paper or Ask Questions

Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Feb 24, 2018

Evan Zheran Liu, Kelvin Guu, Panupong Pasupat, Tianlin Shi, Percy Liang

Figure 1 for Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Figure 2 for Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Figure 3 for Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Figure 4 for Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Abstract:Reinforcement learning (RL) agents improve through trial-and-error, but when reward is sparse and the agent cannot discover successful action sequences, learning stagnates. This has been a notable problem in training deep RL agents to perform web-based tasks, such as booking flights or replying to emails, where a single mistake can ruin the entire sequence of actions. A common remedy is to "warm-start" the agent by pre-training it to mimic expert demonstrations, but this is prone to overfitting. Instead, we propose to constrain exploration using demonstrations. From each demonstration, we induce high-level "workflows" which constrain the allowable actions at each time step to be similar to those in the demonstration (e.g., "Step 1: click on a textbox; Step 2: enter some text"). Our exploration policy then learns to identify successful workflows and samples actions that satisfy these workflows. Workflows prune out bad exploration directions and accelerate the agent's ability to discover rewards. We use our approach to train a novel neural policy designed to handle the semi-structured nature of websites, and evaluate on a suite of web tasks, including the recent World of Bits benchmark. We achieve new state-of-the-art results, and show that workflow-guided exploration improves sample efficiency over behavioral cloning by more than 100x.

* International Conference on Learning Representations (ICLR), 2018

Via

Access Paper or Ask Questions

Macro Grammars and Holistic Triggering for Efficient Semantic Parsing

Aug 31, 2017

Yuchen Zhang, Panupong Pasupat, Percy Liang

Figure 1 for Macro Grammars and Holistic Triggering for Efficient Semantic Parsing

Figure 2 for Macro Grammars and Holistic Triggering for Efficient Semantic Parsing

Figure 3 for Macro Grammars and Holistic Triggering for Efficient Semantic Parsing

Figure 4 for Macro Grammars and Holistic Triggering for Efficient Semantic Parsing

Abstract:To learn a semantic parser from denotations, a learning algorithm must search over a combinatorially large space of logical forms for ones consistent with the annotated denotations. We propose a new online learning algorithm that searches faster as training progresses. The two key ideas are using macro grammars to cache the abstract patterns of useful logical forms found thus far, and holistic triggering to efficiently retrieve the most relevant patterns based on sentence similarity. On the WikiTableQuestions dataset, we first expand the search space of an existing model to improve the state-of-the-art accuracy from 38.7% to 42.7%, and then use macro grammars and holistic triggering to achieve an 11x speedup and an accuracy of 43.7%.

* EMNLP 2017

Via

Access Paper or Ask Questions

From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood

Apr 25, 2017

Kelvin Guu, Panupong Pasupat, Evan Zheran Liu, Percy Liang

Figure 1 for From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood

Figure 2 for From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood

Figure 3 for From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood

Figure 4 for From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood

Abstract:Our goal is to learn a semantic parser that maps natural language utterances into executable programs when only indirect supervision is available: examples are labeled with the correct execution result, but not the program itself. Consequently, we must search the space of programs for those that output the correct result, while not being misled by spurious programs: incorrect programs that coincidentally output the correct result. We connect two common learning paradigms, reinforcement learning (RL) and maximum marginal likelihood (MML), and then present a new learning algorithm that combines the strengths of both. The new algorithm guards against spurious programs by combining the systematic search traditionally employed in MML with the randomized exploration of RL, and by updating parameters such that probability is spread more evenly across consistent programs. We apply our learning algorithm to a new neural semantic parser and show significant gains over existing state-of-the-art results on a recent context-dependent semantic parsing task.

* Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (2017)

Via

Access Paper or Ask Questions

Inferring Logical Forms From Denotations

Nov 15, 2016

Panupong Pasupat, Percy Liang

Figure 1 for Inferring Logical Forms From Denotations

Figure 2 for Inferring Logical Forms From Denotations

Figure 3 for Inferring Logical Forms From Denotations

Figure 4 for Inferring Logical Forms From Denotations

Abstract:A core problem in learning semantic parsers from denotations is picking out consistent logical forms--those that yield the correct denotation--from a combinatorially large space. To control the search space, previous work relied on restricted set of rules, which limits expressivity. In this paper, we consider a much more expressive class of logical forms, and show how to use dynamic programming to efficiently represent the complete set of consistent logical forms. Expressivity also introduces many more spurious logical forms which are consistent with the correct denotation but do not represent the meaning of the utterance. To address this, we generate fictitious worlds and use crowdsourced denotations on these worlds to filter out spurious logical forms. On the WikiTableQuestions dataset, we increase the coverage of answerable questions from 53.5% to 76%, and the additional crowdsourced supervision lets us rule out 92.1% of spurious logical forms.

* Published at the Association for Computational Linguistics (ACL) conference, 2016

Via

Access Paper or Ask Questions

Simpler Context-Dependent Logical Forms via Model Projections

Jun 16, 2016

Reginald Long, Panupong Pasupat, Percy Liang

Figure 1 for Simpler Context-Dependent Logical Forms via Model Projections

Figure 2 for Simpler Context-Dependent Logical Forms via Model Projections

Figure 3 for Simpler Context-Dependent Logical Forms via Model Projections

Figure 4 for Simpler Context-Dependent Logical Forms via Model Projections

Abstract:We consider the task of learning a context-dependent mapping from utterances to denotations. With only denotations at training time, we must search over a combinatorially large space of logical forms, which is even larger with context-dependent utterances. To cope with this challenge, we perform successive projections of the full model onto simpler models that operate over equivalence classes of logical forms. Though less expressive, we find that these simpler models are much faster and can be surprisingly effective. Moreover, they can be used to bootstrap the full model. Finally, we collected three new context-dependent semantic parsing datasets, and develop a new left-to-right parser.

* 10 pages, ACL 2016

Via

Access Paper or Ask Questions

Compositional Semantic Parsing on Semi-Structured Tables

Aug 03, 2015

Panupong Pasupat, Percy Liang

Figure 1 for Compositional Semantic Parsing on Semi-Structured Tables

Figure 2 for Compositional Semantic Parsing on Semi-Structured Tables

Figure 3 for Compositional Semantic Parsing on Semi-Structured Tables

Figure 4 for Compositional Semantic Parsing on Semi-Structured Tables

Abstract:Two important aspects of semantic parsing for question answering are the breadth of the knowledge source and the depth of logical compositionality. While existing work trades off one aspect for another, this paper simultaneously makes progress on both fronts through a new task: answering complex questions on semi-structured tables using question-answer pairs as supervision. The central challenge arises from two compounding factors: the broader domain results in an open-ended set of relations, and the deeper compositionality results in a combinatorial explosion in the space of logical forms. We propose a logical-form driven parsing algorithm guided by strong typing constraints and show that it obtains significant improvements over natural baselines. For evaluation, we created a new dataset of 22,033 complex questions on Wikipedia tables, which is made publicly available.

Via

Access Paper or Ask Questions