Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stephen Clark

Generating syntactically varied realisations from AMR graphs

Apr 20, 2018
Kris Cao, Stephen Clark

Figure 1 for Generating syntactically varied realisations from AMR graphs

Figure 2 for Generating syntactically varied realisations from AMR graphs

Figure 3 for Generating syntactically varied realisations from AMR graphs

Figure 4 for Generating syntactically varied realisations from AMR graphs

Generating from Abstract Meaning Representation (AMR) is an underspecified problem, as many syntactic decisions are not specified by the semantic graph. We learn a sequence-to-sequence model that generates possible constituency trees for an AMR graph, and then train another model to generate text realisations conditioned on both an AMR graph and a constituency tree. We show that factorising the model this way lets us effectively use parse information, obtaining competitive BLEU scores on self-generated parses and impressive BLEU scores with oracle parses. We also demonstrate that we can generate meaning-preserving syntactic paraphrases of the same AMR graph.

Via

Access Paper or Ask Questions

Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input

Apr 11, 2018
Angeliki Lazaridou, Karl Moritz Hermann, Karl Tuyls, Stephen Clark

Figure 1 for Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input

Figure 2 for Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input

Figure 3 for Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input

Figure 4 for Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input

The ability of algorithms to evolve or learn (compositional) communication protocols has traditionally been studied in the language evolution literature through the use of emergent communication tasks. Here we scale up this research by using contemporary deep learning methods and by training reinforcement-learning neural network agents on referential communication games. We extend previous work, in which agents were trained in symbolic environments, by developing agents which are able to learn from raw pixel data, a more challenging and realistic input representation. We find that the degree of structure found in the input data affects the nature of the emerged protocols, and thereby corroborate the hypothesis that structured compositional language is most likely to emerge when agents perceive the world as being structured.

* To appear at ICLR 2018

Via

Access Paper or Ask Questions

Emergent Communication through Negotiation

Apr 11, 2018
Kris Cao, Angeliki Lazaridou, Marc Lanctot, Joel Z Leibo, Karl Tuyls, Stephen Clark

Figure 1 for Emergent Communication through Negotiation

Figure 2 for Emergent Communication through Negotiation

Figure 3 for Emergent Communication through Negotiation

Figure 4 for Emergent Communication through Negotiation

Multi-agent reinforcement learning offers a way to study how communication could emerge in communities of agents needing to solve specific problems. In this paper, we study the emergence of communication in the negotiation environment, a semi-cooperative model of agent interaction. We introduce two communication protocols -- one grounded in the semantics of the game, and one which is \textit{a priori} ungrounded and is a form of cheap talk. We show that self-interested agents can use the pre-grounded communication channel to negotiate fairly, but are unable to effectively use the ungrounded channel. However, prosocial agents do learn to use cheap talk to find an optimal negotiating strategy, suggesting that cooperation is necessary for language to emerge. We also study communication behaviour in a setting where one agent interacts with agents in a community with different levels of prosociality and show how agent identifiability can aid negotiation.

* Published as a conference paper at ICLR 2018

Via

Access Paper or Ask Questions

Understanding Grounded Language Learning Agents

Oct 26, 2017
Felix Hill, Karl Moritz Hermann, Phil Blunsom, Stephen Clark

Figure 1 for Understanding Grounded Language Learning Agents

Figure 2 for Understanding Grounded Language Learning Agents

Figure 3 for Understanding Grounded Language Learning Agents

Figure 4 for Understanding Grounded Language Learning Agents

Neural network-based systems can now learn to locate the referents of words and phrases in images, answer questions about visual scenes, and even execute symbolic instructions as first-person actors in partially-observable worlds. To achieve this so-called grounded language learning, models must overcome certain well-studied learning challenges that are also fundamental to infants learning their first words. While it is notable that models with no meaningful prior knowledge overcome these learning obstacles, AI researchers and practitioners currently lack a clear understanding of exactly how they do so. Here we address this question as a way of achieving a clearer general understanding of grounded language learning, both to inform future research and to improve confidence in model predictions. For maximum control and generality, we focus on a simple neural network-based language learning agent trained via policy-gradient methods to interpret synthetic linguistic instructions in a simulated 3D world. We apply experimental paradigms from developmental psychology to this agent, exploring the conditions under which established human biases and learning effects emerge. We further propose a novel way to visualise and analyse semantic representation in grounded language learning agents that yields a plausible computational account of the observed effects.

Via

Access Paper or Ask Questions

Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs

May 25, 2017
Jean Maillard, Stephen Clark, Dani Yogatama

Figure 1 for Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs

Figure 2 for Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs

Figure 3 for Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs

Figure 4 for Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs

We introduce a neural network that represents sentences by composing their words according to induced binary parse trees. We use Tree-LSTM as our composition function, applied along a tree structure found by a fully differentiable natural language chart parser. Our model simultaneously optimises both the composition function and the parser, thus eliminating the need for externally-provided parse trees which are normally required for Tree-LSTM. It can therefore be seen as a tree-based RNN that is unsupervised with respect to the parse trees. As it is fully differentiable, our model is easily trained with an off-the-shelf gradient descent method and backpropagation. We demonstrate that it achieves better performance compared to various supervised Tree-LSTM architectures on a textual entailment task and a reverse dictionary task.

Via

Access Paper or Ask Questions

Latent Variable Dialogue Models and their Diversity

Feb 20, 2017
Kris Cao, Stephen Clark

Figure 1 for Latent Variable Dialogue Models and their Diversity

Figure 2 for Latent Variable Dialogue Models and their Diversity

Figure 3 for Latent Variable Dialogue Models and their Diversity

Figure 4 for Latent Variable Dialogue Models and their Diversity

We present a dialogue generation model that directly captures the variability in possible responses to a given input, which reduces the `boring output' issue of deterministic dialogue models. Experiments show that our model generates more diverse outputs than baseline models, and also generates more consistently acceptable output than sampling from a deterministic encoder-decoder model.

* Accepted at EACL 2017

Via

Access Paper or Ask Questions

Virtual Embodiment: A Scalable Long-Term Strategy for Artificial Intelligence Research

Oct 24, 2016
Douwe Kiela, Luana Bulat, Anita L. Vero, Stephen Clark

Meaning has been called the "holy grail" of a variety of scientific disciplines, ranging from linguistics to philosophy, psychology and the neurosciences. The field of Artifical Intelligence (AI) is very much a part of that list: the development of sophisticated natural language semantics is a sine qua non for achieving a level of intelligence comparable to humans. Embodiment theories in cognitive science hold that human semantic representation depends on sensori-motor experience; the abundant evidence that human meaning representation is grounded in the perception of physical reality leads to the conclusion that meaning must depend on a fusion of multiple (perceptual) modalities. Despite this, AI research in general, and its subdisciplines such as computational linguistics and computer vision in particular, have focused primarily on tasks that involve a single modality. Here, we propose virtual embodiment as an alternative, long-term strategy for AI research that is multi-modal in nature and that allows for the kind of scalability required to develop the field coherently and incrementally, in an ethically responsible fashion.

Via

Access Paper or Ask Questions

Using Sentence Plausibility to Learn the Semantics of Transitive Verbs

Dec 12, 2014
Tamara Polajnar, Laura Rimell, Stephen Clark

Figure 1 for Using Sentence Plausibility to Learn the Semantics of Transitive Verbs

The functional approach to compositional distributional semantics considers transitive verbs to be linear maps that transform the distributional vectors representing nouns into a vector representing a sentence. We conduct an initial investigation that uses a matrix consisting of the parameters of a logistic regression classifier trained on a plausibility task as a transitive verb function. We compare our method to a commonly used corpus-based method for constructing a verb matrix and find that the plausibility training may be more effective for disambiguation tasks.

* Full updated paper for NIPS learning semantics workshop, with some minor errata fixed

Via

Access Paper or Ask Questions

The Frobenius anatomy of word meanings II: possessive relative pronouns

Jun 18, 2014
Mehrnoosh Sadrzadeh, Stephen Clark, Bob Coecke

Within the categorical compositional distributional model of meaning, we provide semantic interpretations for the subject and object roles of the possessive relative pronoun `whose'. This is done in terms of Frobenius algebras over compact closed categories. These algebras and their diagrammatic language expose how meanings of words in relative clauses interact with each other. We show how our interpretation is related to Montague-style semantics and provide a truth-theoretic interpretation. We also show how vector spaces provide a concrete interpretation and provide preliminary corpus-based experimental evidence. In a prequel to this paper, we used similar methods and dealt with the case of subject and object relative pronouns.

* 40 pages, Journal of Logic and Computation, Essays dedicated to Roy Dyckhoff on the occasion of his retirement, S. Graham-Lengrand and D. Galmiche (eds.), 2014

Via

Access Paper or Ask Questions

The Frobenius anatomy of word meanings I: subject and object relative pronouns

Apr 21, 2014
Mehrnoosh Sadrzadeh, Stephen Clark, Bob Coecke

This paper develops a compositional vector-based semantics of subject and object relative pronouns within a categorical framework. Frobenius algebras are used to formalise the operations required to model the semantics of relative pronouns, including passing information between the relative clause and the modified noun phrase, as well as copying, combining, and discarding parts of the relative clause. We develop two instantiations of the abstract semantics, one based on a truth-theoretic approach and one based on corpus statistics.

* Journal of Logic and Computation, Special Issue: The Incomputable, an Isaac Newton Institute Workshop, 23(6), pp.1293-1317, 2013
* 31 pages

Via

Access Paper or Ask Questions