Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Phil Blunsom

Understanding Grounded Language Learning Agents

Oct 26, 2017

Felix Hill, Karl Moritz Hermann, Phil Blunsom, Stephen Clark

Figure 1 for Understanding Grounded Language Learning Agents

Figure 2 for Understanding Grounded Language Learning Agents

Figure 3 for Understanding Grounded Language Learning Agents

Figure 4 for Understanding Grounded Language Learning Agents

Abstract:Neural network-based systems can now learn to locate the referents of words and phrases in images, answer questions about visual scenes, and even execute symbolic instructions as first-person actors in partially-observable worlds. To achieve this so-called grounded language learning, models must overcome certain well-studied learning challenges that are also fundamental to infants learning their first words. While it is notable that models with no meaningful prior knowledge overcome these learning obstacles, AI researchers and practitioners currently lack a clear understanding of exactly how they do so. Here we address this question as a way of achieving a clearer general understanding of grounded language learning, both to inform future research and to improve confidence in model predictions. For maximum control and generality, we focus on a simple neural network-based language learning agent trained via policy-gradient methods to interpret synthetic linguistic instructions in a simulated 3D world. We apply experimental paradigms from developmental psychology to this agent, exploring the conditions under which established human biases and learning effects emerge. We further propose a novel way to visualise and analyse semantic representation in grounded language learning agents that yields a plausible computational account of the observed effects.

Via

Access Paper or Ask Questions

Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems

Oct 23, 2017

Wang Ling, Dani Yogatama, Chris Dyer, Phil Blunsom

Figure 1 for Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems

Figure 2 for Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems

Figure 3 for Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems

Figure 4 for Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems

Abstract:Solving algebraic word problems requires executing a series of arithmetic operations---a program---to obtain a final answer. However, since programs can be arbitrarily complicated, inducing them directly from question-answer pairs is a formidable challenge. To make this task more feasible, we solve these problems by generating answer rationales, sequences of natural language and human-readable mathematical expressions that derive the final answer through a series of small steps. Although rationales do not explicitly specify programs, they provide a scaffolding for their structure via intermediate milestones. To evaluate our approach, we have created a new 100,000-sample dataset of questions, answers and rationales. Experimental results show that indirect supervision of program learning via answer rationales is a promising strategy for inducing arithmetic programs.

Via

Access Paper or Ask Questions

Reference-Aware Language Models

Aug 09, 2017

Zichao Yang, Phil Blunsom, Chris Dyer, Wang Ling

Figure 1 for Reference-Aware Language Models

Figure 2 for Reference-Aware Language Models

Figure 3 for Reference-Aware Language Models

Figure 4 for Reference-Aware Language Models

Abstract:We propose a general class of language models that treat reference as an explicit stochastic latent variable. This architecture allows models to create mentions of entities and their attributes by accessing external databases (required by, e.g., dialogue generation and recipe generation) and internal state (required by, e.g. language models which are aware of coreference). This facilitates the incorporation of information that can be accessed in predictable locations in databases or discourse context, even when the targets of the reference may be rare words. Experiments on three tasks shows our model variants based on deterministic attention.

* emnlp camera ready

Via

Access Paper or Ask Questions

Robust Incremental Neural Semantic Graph Parsing

Jul 27, 2017

Jan Buys, Phil Blunsom

Figure 1 for Robust Incremental Neural Semantic Graph Parsing

Figure 2 for Robust Incremental Neural Semantic Graph Parsing

Figure 3 for Robust Incremental Neural Semantic Graph Parsing

Figure 4 for Robust Incremental Neural Semantic Graph Parsing

Abstract:Parsing sentences to linguistically-expressive semantic representations is a key goal of Natural Language Processing. Yet statistical parsing has focused almost exclusively on bilexical dependencies or domain-specific logical forms. We propose a neural encoder-decoder transition-based parser which is the first full-coverage semantic graph parser for Minimal Recursion Semantics (MRS). The model architecture uses stack-based embedding features, predicting graphs jointly with unlexicalized predicates and their token alignments. Our parser is more accurate than attention-based baselines on MRS, and on an additional Abstract Meaning Representation (AMR) benchmark, and GPU batch processing makes it an order of magnitude faster than a high-precision grammar-based parser. Further, the 86.69% Smatch score of our MRS parser is higher than the upper-bound on AMR parsing, making MRS an attractive choice as a semantic representation.

* 12 pages; ACL 2017

Via

Access Paper or Ask Questions

Grounded Language Learning in a Simulated 3D World

Jun 26, 2017

Karl Moritz Hermann, Felix Hill, Simon Green, Fumin Wang, Ryan Faulkner, Hubert Soyer, David Szepesvari, Wojciech Marian Czarnecki, Max Jaderberg, Denis Teplyashin(+4 more)

Figure 1 for Grounded Language Learning in a Simulated 3D World

Figure 2 for Grounded Language Learning in a Simulated 3D World

Figure 3 for Grounded Language Learning in a Simulated 3D World

Figure 4 for Grounded Language Learning in a Simulated 3D World

Abstract:We are increasingly surrounded by artificially intelligent technology that takes decisions and executes actions on our behalf. This creates a pressing need for general means to communicate with, instruct and guide artificial agents, with human language the most compelling means for such communication. To achieve this in a scalable fashion, agents must be able to relate language to the world and to actions; that is, their understanding of language must be grounded and embodied. However, learning grounded language is a notoriously challenging problem in artificial intelligence research. Here we present an agent that learns to interpret language in a simulated 3D environment where it is rewarded for the successful execution of written instructions. Trained via a combination of reinforcement and unsupervised learning, and beginning with minimal prior knowledge, the agent learns to relate linguistic symbols to emergent perceptual representations of its physical surroundings and to pertinent sequences of actions. The agent's comprehension of language extends beyond its prior experience, enabling it to apply familiar language to unfamiliar situations and to interpret entirely novel instructions. Moreover, the speed with which this agent learns new words increases as its semantic knowledge grows. This facility for generalising and bootstrapping semantic knowledge indicates the potential of the present approach for reconciling ambiguous natural language with the complexity of the physical world.

* 16 pages, 8 figures

Via

Access Paper or Ask Questions

Latent Intention Dialogue Models

May 29, 2017

Tsung-Hsien Wen, Yishu Miao, Phil Blunsom, Steve Young

Figure 1 for Latent Intention Dialogue Models

Figure 2 for Latent Intention Dialogue Models

Figure 3 for Latent Intention Dialogue Models

Figure 4 for Latent Intention Dialogue Models

Abstract:Developing a dialogue agent that is capable of making autonomous decisions and communicating by natural language is one of the long-term goals of machine learning research. Traditional approaches either rely on hand-crafting a small state-action set for applying reinforcement learning that is not scalable or constructing deterministic models for learning dialogue sentences that fail to capture natural conversational variability. In this paper, we propose a Latent Intention Dialogue Model (LIDM) that employs a discrete latent variable to learn underlying dialogue intentions in the framework of neural variational inference. In a goal-oriented dialogue scenario, these latent intentions can be interpreted as actions guiding the generation of machine responses, which can be further refined autonomously by reinforcement learning. The experimental evaluation of LIDM shows that the model out-performs published benchmarks for both corpus-based and human evaluation, demonstrating the effectiveness of discrete latent variable models for learning goal-oriented dialogues.

* Accepted at ICML 2017

Via

Access Paper or Ask Questions

Generative and Discriminative Text Classification with Recurrent Neural Networks

May 26, 2017

Dani Yogatama, Chris Dyer, Wang Ling, Phil Blunsom

Figure 1 for Generative and Discriminative Text Classification with Recurrent Neural Networks

Figure 2 for Generative and Discriminative Text Classification with Recurrent Neural Networks

Figure 3 for Generative and Discriminative Text Classification with Recurrent Neural Networks

Figure 4 for Generative and Discriminative Text Classification with Recurrent Neural Networks

Abstract:We empirically characterize the performance of discriminative and generative LSTM models for text classification. We find that although RNN-based generative models are more powerful than their bag-of-words ancestors (e.g., they account for conditional dependencies across words in a document), they have higher asymptotic error rates than discriminatively trained RNN models. However we also find that generative models approach their asymptotic error rate more rapidly than their discriminative counterparts---the same pattern that Ng & Jordan (2001) proved holds for linear classification models that make more naive conditional independence assumptions. Building on this finding, we hypothesize that RNN-based generative classification models will be more robust to shifts in the data distribution. This hypothesis is confirmed in a series of experiments in zero-shot and continual learning settings that show that generative models substantially outperform discriminative models.

Via

Access Paper or Ask Questions

Learning to Create and Reuse Words in Open-Vocabulary Neural Language Modeling

Apr 23, 2017

Kazuya Kawakami, Chris Dyer, Phil Blunsom

Figure 1 for Learning to Create and Reuse Words in Open-Vocabulary Neural Language Modeling

Figure 2 for Learning to Create and Reuse Words in Open-Vocabulary Neural Language Modeling

Figure 3 for Learning to Create and Reuse Words in Open-Vocabulary Neural Language Modeling

Figure 4 for Learning to Create and Reuse Words in Open-Vocabulary Neural Language Modeling

Abstract:Fixed-vocabulary language models fail to account for one of the most characteristic statistical facts of natural language: the frequent creation and reuse of new word types. Although character-level language models offer a partial solution in that they can create word types not attested in the training corpus, they do not capture the "bursty" distribution of such words. In this paper, we augment a hierarchical LSTM language model that generates sequences of word tokens character by character with a caching mechanism that learns to reuse previously generated words. To validate our model we construct a new open-vocabulary language modeling corpus (the Multilingual Wikipedia Corpus, MWC) from comparable Wikipedia articles in 7 typologically diverse languages and demonstrate the effectiveness of our model across this range of languages.

* ACL 2017

Via

Access Paper or Ask Questions

The Neural Noisy Channel

Mar 06, 2017

Lei Yu, Phil Blunsom, Chris Dyer, Edward Grefenstette, Tomas Kocisky

Abstract:We formulate sequence to sequence transduction as a noisy channel decoding problem and use recurrent neural networks to parameterise the source and channel models. Unlike direct models which can suffer from explaining-away effects during training, noisy channel models must produce outputs that explain their inputs, and their component models can be trained with not only paired training samples but also unpaired samples from the marginal output distribution. Using a latent variable to control how much of the conditioning sequence the channel model needs to read in order to generate a subsequent symbol, we obtain a tractable and effective beam search decoder. Experimental results on abstractive sentence summarisation, morphological inflection, and machine translation show that noisy channel models outperform direct models, and that they significantly benefit from increased amounts of unpaired output data that direct models cannot easily use.

* ICLR 2017

Via

Access Paper or Ask Questions

Learning to Compose Words into Sentences with Reinforcement Learning

Nov 28, 2016

Dani Yogatama, Phil Blunsom, Chris Dyer, Edward Grefenstette, Wang Ling

Figure 1 for Learning to Compose Words into Sentences with Reinforcement Learning

Figure 2 for Learning to Compose Words into Sentences with Reinforcement Learning

Figure 3 for Learning to Compose Words into Sentences with Reinforcement Learning

Figure 4 for Learning to Compose Words into Sentences with Reinforcement Learning

Abstract:We use reinforcement learning to learn tree-structured neural networks for computing representations of natural language sentences. In contrast with prior work on tree-structured models in which the trees are either provided as input or predicted using supervision from explicit treebank annotations, the tree structures in this work are optimized to improve performance on a downstream task. Experiments demonstrate the benefit of learning task-specific composition orders, outperforming both sequential encoders and recursive encoders based on treebank annotations. We analyze the induced trees and show that while they discover some linguistically intuitive structures (e.g., noun phrases, simple verb phrases), they are different than conventional English syntactic structures.

Via

Access Paper or Ask Questions