Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Adhiguna Kuncoro

Unsupervised Recurrent Neural Network Grammars

Apr 15, 2019

Yoon Kim, Alexander M. Rush, Lei Yu, Adhiguna Kuncoro, Chris Dyer, Gábor Melis

Figure 1 for Unsupervised Recurrent Neural Network Grammars

Figure 2 for Unsupervised Recurrent Neural Network Grammars

Figure 3 for Unsupervised Recurrent Neural Network Grammars

Figure 4 for Unsupervised Recurrent Neural Network Grammars

Abstract:Recurrent neural network grammars (RNNG) are generative models of language which jointly model syntax and surface structure by incrementally generating a syntax tree and sentence in a top-down, left-to-right order. Supervised RNNGs achieve strong language modeling and parsing performance, but require an annotated corpus of parse trees. In this work, we experiment with unsupervised learning of RNNGs. Since directly marginalizing over the space of latent trees is intractable, we instead apply amortized variational inference. To maximize the evidence lower bound, we develop an inference network parameterized as a neural CRF constituency parser. On language modeling, unsupervised RNNGs perform as well their supervised counterparts on benchmarks in English and Chinese. On constituency grammar induction, they are competitive with recent neural language models that induce tree structures from words through attention mechanisms.

* NAACL 2019

Via

Access Paper or Ask Questions

Finding Syntax in Human Encephalography with Beam Search

Jun 11, 2018

John Hale, Chris Dyer, Adhiguna Kuncoro, Jonathan R. Brennan

Figure 1 for Finding Syntax in Human Encephalography with Beam Search

Figure 2 for Finding Syntax in Human Encephalography with Beam Search

Figure 3 for Finding Syntax in Human Encephalography with Beam Search

Figure 4 for Finding Syntax in Human Encephalography with Beam Search

Abstract:Recurrent neural network grammars (RNNGs) are generative models of (tree,string) pairs that rely on neural networks to evaluate derivational choices. Parsing with them using beam search yields a variety of incremental complexity metrics such as word surprisal and parser action count. When used as regressors against human electrophysiological responses to naturalistic text, they derive two amplitude effects: an early peak and a P600-like later peak. By contrast, a non-syntactic neural language model yields no reliable effects. Model comparisons attribute the early peak to syntactic composition within the RNNG. This pattern of results recommends the RNNG+beam search combination as a mechanistic model of the syntactic processing that occurs during normal human language comprehension.

* ACL2018

Via

Access Paper or Ask Questions

DyNet: The Dynamic Neural Network Toolkit

Jan 15, 2017

Graham Neubig, Chris Dyer, Yoav Goldberg, Austin Matthews, Waleed Ammar, Antonios Anastasopoulos, Miguel Ballesteros, David Chiang, Daniel Clothiaux, Trevor Cohn(+15 more)

Figure 1 for DyNet: The Dynamic Neural Network Toolkit

Figure 2 for DyNet: The Dynamic Neural Network Toolkit

Figure 3 for DyNet: The Dynamic Neural Network Toolkit

Figure 4 for DyNet: The Dynamic Neural Network Toolkit

Abstract:We describe DyNet, a toolkit for implementing neural network models based on dynamic declaration of network structure. In the static declaration strategy that is used in toolkits like Theano, CNTK, and TensorFlow, the user first defines a computation graph (a symbolic representation of the computation), and then examples are fed into an engine that executes this computation and computes its derivatives. In DyNet's dynamic declaration strategy, computation graph construction is mostly transparent, being implicitly constructed by executing procedural code that computes the network outputs, and the user is free to use different network structures for each input. Dynamic declaration thus facilitates the implementation of more complicated network architectures, and DyNet is specifically designed to allow users to implement their models in a way that is idiomatic in their preferred programming language (C++ or Python). One challenge with dynamic declaration is that because the symbolic computation graph is defined anew for every training example, its construction must have low overhead. To achieve this, DyNet has an optimized C++ backend and lightweight graph representation. Experiments show that DyNet's speeds are faster than or comparable with static declaration toolkits, and significantly faster than Chainer, another dynamic declaration toolkit. DyNet is released open-source under the Apache 2.0 license and available at http://github.com/clab/dynet.

* 33 pages

Via

Access Paper or Ask Questions

What Do Recurrent Neural Network Grammars Learn About Syntax?

Jan 10, 2017

Adhiguna Kuncoro, Miguel Ballesteros, Lingpeng Kong, Chris Dyer, Graham Neubig, Noah A. Smith

Figure 1 for What Do Recurrent Neural Network Grammars Learn About Syntax?

Figure 2 for What Do Recurrent Neural Network Grammars Learn About Syntax?

Figure 3 for What Do Recurrent Neural Network Grammars Learn About Syntax?

Figure 4 for What Do Recurrent Neural Network Grammars Learn About Syntax?

Abstract:Recurrent neural network grammars (RNNG) are a recently proposed probabilistic generative modeling family for natural language. They show state-of-the-art language modeling and parsing performance. We investigate what information they learn, from a linguistic perspective, through various ablations to the model and the data, and by augmenting the model with an attention mechanism (GA-RNNG) to enable closer inspection. We find that explicit modeling of composition is crucial for achieving the best performance. Through the attention mechanism, we find that headedness plays a central role in phrasal representation (with the model's latent attention largely agreeing with predictions made by hand-crafted head rules, albeit with some important differences). By training grammars without nonterminal labels, we find that phrasal representations depend minimally on nonterminals, providing support for the endocentricity hypothesis.

* 10 pages. To appear in EACL 2017, Valencia, Spain

Via

Access Paper or Ask Questions

Recurrent Neural Network Grammars

Oct 12, 2016

Chris Dyer, Adhiguna Kuncoro, Miguel Ballesteros, Noah A. Smith

Figure 1 for Recurrent Neural Network Grammars

Figure 2 for Recurrent Neural Network Grammars

Figure 3 for Recurrent Neural Network Grammars

Figure 4 for Recurrent Neural Network Grammars

Abstract:We introduce recurrent neural network grammars, probabilistic models of sentences with explicit phrase structure. We explain efficient inference procedures that allow application to both parsing and language modeling. Experiments show that they provide better parsing in English than any single previously published supervised generative model and better language modeling than state-of-the-art sequential RNNs in English and Chinese.

* Proceedings of NAACL 2016 (contains corrigendum)

Via

Access Paper or Ask Questions

Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser

Sep 24, 2016

Adhiguna Kuncoro, Miguel Ballesteros, Lingpeng Kong, Chris Dyer, Noah A. Smith

Figure 1 for Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser

Figure 2 for Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser

Figure 3 for Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser

Figure 4 for Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser

Abstract:We introduce two first-order graph-based dependency parsers achieving a new state of the art. The first is a consensus parser built from an ensemble of independently trained greedy LSTM transition-based parsers with different random initializations. We cast this approach as minimum Bayes risk decoding (under the Hamming cost) and argue that weaker consensus within the ensemble is a useful signal of difficulty or ambiguity. The second parser is a "distillation" of the ensemble into a single model. We train the distillation parser using a structured hinge loss objective with a novel cost that incorporates ensemble uncertainty estimates for each possible attachment, thereby avoiding the intractable cross-entropy computations required by applying standard distillation objectives to problems with structured outputs. The first-order distillation parser matches or surpasses the state of the art on English, Chinese, and German.

* 10 pages. To appear at EMNLP 2016

Via

Access Paper or Ask Questions

Dependency Parsing with LSTMs: An Empirical Evaluation

Jun 30, 2016

Adhiguna Kuncoro, Yuichiro Sawai, Kevin Duh, Yuji Matsumoto

Figure 1 for Dependency Parsing with LSTMs: An Empirical Evaluation

Figure 2 for Dependency Parsing with LSTMs: An Empirical Evaluation

Figure 3 for Dependency Parsing with LSTMs: An Empirical Evaluation

Figure 4 for Dependency Parsing with LSTMs: An Empirical Evaluation

Abstract:We propose a transition-based dependency parser using Recurrent Neural Networks with Long Short-Term Memory (LSTM) units. This extends the feedforward neural network parser of Chen and Manning (2014) and enables modelling of entire sequences of shift/reduce transition decisions. On the Google Web Treebank, our LSTM parser is competitive with the best feedforward parser on overall accuracy and notably achieves more than 3% improvement for long-range dependencies, which has proved difficult for previous transition-based parsers due to error propagation and limited context information. Our findings additionally suggest that dropout regularisation on the embedding layer is crucial to improve the LSTM's generalisation.

* 7 pages, 4 figures

Via

Access Paper or Ask Questions