Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stuart M. Shieber

Harvard University

Challenges in Data-to-Document Generation

Jul 25, 2017

Sam Wiseman, Stuart M. Shieber, Alexander M. Rush

Figure 1 for Challenges in Data-to-Document Generation

Figure 2 for Challenges in Data-to-Document Generation

Figure 3 for Challenges in Data-to-Document Generation

Figure 4 for Challenges in Data-to-Document Generation

Abstract:Recent neural models have shown significant progress on the problem of generating short descriptive texts conditioned on a small number of database records. In this work, we suggest a slightly more difficult data-to-text generation task, and investigate how effective current approaches are on this task. In particular, we introduce a new, large-scale corpus of data records paired with descriptive documents, propose a series of extractive evaluation methods for analyzing performance, and obtain baseline results using current neural generation methods. Experiments show that these models produce fluent text, but fail to convincingly approximate human-generated documents. Moreover, even templated baselines exceed the performance of these neural models on some metrics, though copy- and reconstruction-based extensions lead to noticeable improvements.

* EMNLP 2017

Via

Access Paper or Ask Questions

Word Ordering Without Syntax

Sep 24, 2016

Allen Schmaltz, Alexander M. Rush, Stuart M. Shieber

Figure 1 for Word Ordering Without Syntax

Figure 2 for Word Ordering Without Syntax

Figure 3 for Word Ordering Without Syntax

Figure 4 for Word Ordering Without Syntax

Abstract:Recent work on word ordering has argued that syntactic structure is important, or even required, for effectively recovering the order of a sentence. We find that, in fact, an n-gram language model with a simple heuristic gives strong results on this task. Furthermore, we show that a long short-term memory (LSTM) language model is even more effective at recovering order, with our basic model outperforming a state-of-the-art syntactic model by 11.5 BLEU points. Additional data and larger beams yield further gains, at the expense of training and search time.

* EMNLP 2016

Via

Access Paper or Ask Questions

Sentence-Level Grammatical Error Identification as Sequence-to-Sequence Correction

Apr 16, 2016

Allen Schmaltz, Yoon Kim, Alexander M. Rush, Stuart M. Shieber

Figure 1 for Sentence-Level Grammatical Error Identification as Sequence-to-Sequence Correction

Figure 2 for Sentence-Level Grammatical Error Identification as Sequence-to-Sequence Correction

Figure 3 for Sentence-Level Grammatical Error Identification as Sequence-to-Sequence Correction

Figure 4 for Sentence-Level Grammatical Error Identification as Sequence-to-Sequence Correction

Abstract:We demonstrate that an attention-based encoder-decoder model can be used for sentence-level grammatical error identification for the Automated Evaluation of Scientific Writing (AESW) Shared Task 2016. The attention-based encoder-decoder models can be used for the generation of corrections, in addition to error identification, which is of interest for certain end-user applications. We show that a character-based encoder-decoder model is particularly effective, outperforming other results on the AESW Shared Task on its own, and showing gains over a word-based counterpart. Our final model--a combination of three character-based encoder-decoder models, one word-based encoder-decoder model, and a sentence-level CNN--is the highest performing system on the AESW 2016 binary prediction Shared Task.

* To appear at BEA11, as part of the AESW 2016 Shared Task

Via

Access Paper or Ask Questions

Learning Global Features for Coreference Resolution

Apr 11, 2016

Sam Wiseman, Alexander M. Rush, Stuart M. Shieber

Figure 1 for Learning Global Features for Coreference Resolution

Figure 2 for Learning Global Features for Coreference Resolution

Figure 3 for Learning Global Features for Coreference Resolution

Figure 4 for Learning Global Features for Coreference Resolution

Abstract:There is compelling evidence that coreference prediction would benefit from modeling global information about entity-clusters. Yet, state-of-the-art performance can be achieved with systems treating each mention prediction independently, which we attribute to the inherent difficulty of crafting informative cluster-level features. We instead propose to use recurrent neural networks (RNNs) to learn latent, global representations of entity clusters directly from their mentions. We show that such representations are especially useful for the prediction of pronominal mentions, and can be incorporated into an end-to-end coreference system that outperforms the state of the art without requiring any additional search.

* Accepted to NAACL 2016

Via

Access Paper or Ask Questions

Recognizing Uncertainty in Speech

Mar 09, 2011

Heather Pon-Barry, Stuart M. Shieber

Figure 1 for Recognizing Uncertainty in Speech

Figure 2 for Recognizing Uncertainty in Speech

Figure 3 for Recognizing Uncertainty in Speech

Figure 4 for Recognizing Uncertainty in Speech

Abstract:We address the problem of inferring a speaker's level of certainty based on prosodic information in the speech signal, which has application in speech-based dialogue systems. We show that using phrase-level prosodic features centered around the phrases causing uncertainty, in addition to utterance-level prosodic features, improves our model's level of certainty classification. In addition, our models can be used to predict which phrase a person is uncertain about. These results rely on a novel method for eliciting utterances of varying levels of certainty that allows us to compare the utility of contextually-based feature sets. We elicit level of certainty ratings from both the speakers themselves and a panel of listeners, finding that there is often a mismatch between speakers' internal states and their perceived states, and highlighting the importance of this distinction.

* EURASIP Journal on Advances in Signal Processing, Volume 2011, Article ID 251753, 11 pages
* 11 pages

Via

Access Paper or Ask Questions

Ellipsis and Higher-Order Unification

Mar 08, 1995

Mary Dalrymple, Stuart M. Shieber, Fernando C. N. Pereira

Abstract:We present a new method for characterizing the interpretive possibilities generated by elliptical constructions in natural language. Unlike previous analyses, which postulate ambiguity of interpretation or derivation in the full clause source of the ellipsis, our analysis requires no such hidden ambiguity. Further, the analysis follows relatively directly from an abstract statement of the ellipsis interpretation problem. It predicts correctly a wide range of interactions between ellipsis and other semantic phenomena such as quantifier scope and bound anaphora. Finally, although the analysis itself is stated nonprocedurally, it admits of a direct computational method for generating interpretations.

* Linguistics and Philosophy 14(4):399-452
* 54 pages

Via

Access Paper or Ask Questions

Restricting the Weak-Generative Capacity of Synchronous Tree-Adjoining Grammars

Aug 30, 1994

Stuart M. Shieber

Figure 1 for Restricting the Weak-Generative Capacity of Synchronous Tree-Adjoining Grammars

Figure 2 for Restricting the Weak-Generative Capacity of Synchronous Tree-Adjoining Grammars

Figure 3 for Restricting the Weak-Generative Capacity of Synchronous Tree-Adjoining Grammars

Figure 4 for Restricting the Weak-Generative Capacity of Synchronous Tree-Adjoining Grammars

Abstract:The formalism of synchronous tree-adjoining grammars, a variant of standard tree-adjoining grammars (TAG), was intended to allow the use of TAGs for language transduction in addition to language specification. In previous work, the definition of the transduction relation defined by a synchronous TAG was given by appeal to an iterative rewriting process. The rewriting definition of derivation is problematic in that it greatly extends the expressivity of the formalism and makes the design of parsing algorithms difficult if not impossible. We introduce a simple, natural definition of synchronous tree-adjoining derivation, based on isomorphisms between standard tree-adjoining derivations, that avoids the expressivity and implementability problems of the original rewriting definition. The decrease in expressivity, which would otherwise make the method unusable, is offset by the incorporation of an alternative definition of standard tree-adjoining derivation, previously proposed for completely separate reasons, thereby making it practical to entertain using the natural definition of synchronous derivation. Nonetheless, some remaining problematic cases call for yet more flexibility in the definition; the isomorphism requirement may have to be relaxed. It remains for future research to tune the exact requirements on the allowable mappings.

* Computational Intelligence 10(4):371-385, November 1994
* 21 pages, uses lingmacros.sty, psfig.sty, fullname.sty; minor typographical changes only

Via

Access Paper or Ask Questions

Principles and Implementation of Deductive Parsing

Apr 26, 1994

Stuart M. Shieber, Yves Schabes, Fernando C. N. Pereira

Figure 1 for Principles and Implementation of Deductive Parsing

Figure 2 for Principles and Implementation of Deductive Parsing

Figure 3 for Principles and Implementation of Deductive Parsing

Figure 4 for Principles and Implementation of Deductive Parsing

Abstract:We present a system for generating parsers based directly on the metaphor of parsing as deduction. Parsing algorithms can be represented directly as deduction systems, and a single deduction engine can interpret such deduction systems so as to implement the corresponding parser. The method generalizes easily to parsers for augmented phrase structure formalisms, such as definite-clause grammars and other logic grammar formalisms, and has been used for rapid prototyping of parsing algorithms for a variety of formalisms including variants of tree-adjoining grammars, categorial grammars, and lexicalized context-free grammars.

* 69 pages, includes full Prolog code

Via

Access Paper or Ask Questions

Lessons from a Restricted Turing Test

Apr 04, 1994

Stuart M. Shieber

Abstract:We report on the recent Loebner prize competition inspired by Turing's test of intelligent behavior. The presentation covers the structure of the competition and the outcome of its first instantiation in an actual event, and an analysis of the purpose, design, and appropriateness of such a competition. We argue that the competition has no clear purpose, that its design prevents any useful outcome, and that such a competition is inappropriate given the current level of technology. We then speculate as to suitable alternatives to the Loebner prize.

* 20 pages

Via

Access Paper or Ask Questions

An Alternative Conception of Tree-Adjoining Derivation

Apr 04, 1994

Yves Schabes, Stuart M. Shieber

Figure 1 for An Alternative Conception of Tree-Adjoining Derivation

Figure 2 for An Alternative Conception of Tree-Adjoining Derivation

Figure 3 for An Alternative Conception of Tree-Adjoining Derivation

Figure 4 for An Alternative Conception of Tree-Adjoining Derivation

Abstract:The precise formulation of derivation for tree-adjoining grammars has important ramifications for a wide variety of uses of the formalism, from syntactic analysis to semantic interpretation and statistical language modeling. We argue that the definition of tree-adjoining derivation must be reformulated in order to manifest the proper linguistic dependencies in derivations. The particular proposal is both precisely characterizable through a definition of TAG derivations as equivalence classes of ordered derivation trees, and computationally operational, by virtue of a compilation to linear indexed grammars together with an efficient algorithm for recognition and parsing according to the compiled grammar.

* Computational Linguistics 20(1):91-124
* 33 pages

Via

Access Paper or Ask Questions