Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Trevor Cohn

Truth Inference at Scale: A Bayesian Model for Adjudicating Highly Redundant Crowd Annotations

Feb 24, 2019
Yuan Li, Benjamin I. P. Rubinstein, Trevor Cohn

Figure 1 for Truth Inference at Scale: A Bayesian Model for Adjudicating Highly Redundant Crowd Annotations

Figure 2 for Truth Inference at Scale: A Bayesian Model for Adjudicating Highly Redundant Crowd Annotations

Figure 3 for Truth Inference at Scale: A Bayesian Model for Adjudicating Highly Redundant Crowd Annotations

Figure 4 for Truth Inference at Scale: A Bayesian Model for Adjudicating Highly Redundant Crowd Annotations

Crowd-sourcing is a cheap and popular means of creating training and evaluation datasets for machine learning, however it poses the problem of `truth inference', as individual workers cannot be wholly trusted to provide reliable annotations. Research into models of annotation aggregation attempts to infer a latent `true' annotation, which has been shown to improve the utility of crowd-sourced data. However, existing techniques beat simple baselines only in low redundancy settings, where the number of annotations per instance is low ($\le 3$), or in situations where workers are unreliable and produce low quality annotations (e.g., through spamming, random, or adversarial behaviours.) As we show, datasets produced by crowd-sourcing are often not of this type: the data is highly redundantly annotated ($\ge 5$ annotations per instance), and the vast majority of workers produce high quality outputs. In these settings, the majority vote heuristic performs very well, and most truth inference models underperform this simple baseline. We propose a novel technique, based on a Bayesian graphical model with conjugate priors, and simple iterative expectation-maximisation inference. Our technique produces competitive performance to the state-of-the-art benchmark methods, and is the only method that significantly outperforms the majority vote heuristic at one-sided level 0.025, shown by significance tests. Moreover, our technique is simple, is implemented in only 50 lines of code, and trains in seconds.

* Accepted at the Web Conference/WWW 2019 (camera ready)

Via

Access Paper or Ask Questions

Multilingual NER Transfer for Low-resource Languages

Feb 01, 2019
Afshin Rahimi, Yuan Li, Trevor Cohn

Figure 1 for Multilingual NER Transfer for Low-resource Languages

Figure 2 for Multilingual NER Transfer for Low-resource Languages

Figure 3 for Multilingual NER Transfer for Low-resource Languages

Figure 4 for Multilingual NER Transfer for Low-resource Languages

In massively multilingual transfer NLP models over many source languages are applied to a low-resource target language. In contrast to most prior work, which use a single model or a small handful, we consider many such models, which raises the critical problem of poor transfer, particularly from distant languages. We propose two techniques for modulating the transfer: one based on unsupervised truth inference, and another using limited supervision in the target language. Evaluating on named entity recognition over 41 languages, we show that our techniques are much more effective than strong baselines, including standard ensembling, and our unsupervised method rivals oracle selection of the single best individual model.

* The first and the second author have equally contributed to this work

Via

Access Paper or Ask Questions

Evaluating the Utility of Hand-crafted Features in Sequence Labelling

Aug 28, 2018
Minghao Wu, Fei Liu, Trevor Cohn

Figure 1 for Evaluating the Utility of Hand-crafted Features in Sequence Labelling

Figure 2 for Evaluating the Utility of Hand-crafted Features in Sequence Labelling

Figure 3 for Evaluating the Utility of Hand-crafted Features in Sequence Labelling

Figure 4 for Evaluating the Utility of Hand-crafted Features in Sequence Labelling

Conventional wisdom is that hand-crafted features are redundant for deep learning models, as they already learn adequate representations of text automatically from corpora. In this work, we test this claim by proposing a new method for exploiting handcrafted features as part of a novel hybrid learning approach, incorporating a feature auto-encoder loss component. We evaluate on the task of named entity recognition (NER), where we show that including manual features for part-of-speech, word shapes and gazetteers can improve the performance of a neural CRF model. We obtain a $F_1$ of 91.89 for the CoNLL-2003 English shared task, which significantly outperforms a collection of highly competitive baseline models. We also present an ablation study showing the importance of auto-encoding, over using features as either inputs or outputs alone, and moreover, show including the autoencoder components reduces training requirements to 60\%, while retaining the same predictive accuracy.

* Accepted to EMNLP 2018 (camera-ready)

Via

Access Paper or Ask Questions

Deep-speare: A Joint Neural Model of Poetic Language, Meter and Rhyme

Jul 10, 2018
Jey Han Lau, Trevor Cohn, Timothy Baldwin, Julian Brooke, Adam Hammond

Figure 1 for Deep-speare: A Joint Neural Model of Poetic Language, Meter and Rhyme

Figure 2 for Deep-speare: A Joint Neural Model of Poetic Language, Meter and Rhyme

Figure 3 for Deep-speare: A Joint Neural Model of Poetic Language, Meter and Rhyme

Figure 4 for Deep-speare: A Joint Neural Model of Poetic Language, Meter and Rhyme

In this paper, we propose a joint architecture that captures language, rhyme and meter for sonnet modelling. We assess the quality of generated poems using crowd and expert judgements. The stress and rhyme models perform very well, as generated poems are largely indistinguishable from human-written poems. Expert evaluation, however, reveals that a vanilla language model captures meter implicitly, and that machine-generated poems still underperform in terms of readability and emotion. Our research shows the importance expert evaluation for poetry generation, and that future research should look beyond rhyme/meter and focus on poetic language.

* Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)
* 11 pages; ACL2018

Via

Access Paper or Ask Questions

Graph-to-Sequence Learning using Gated Graph Neural Networks

Jun 26, 2018
Daniel Beck, Gholamreza Haffari, Trevor Cohn

Figure 1 for Graph-to-Sequence Learning using Gated Graph Neural Networks

Figure 2 for Graph-to-Sequence Learning using Gated Graph Neural Networks

Figure 3 for Graph-to-Sequence Learning using Gated Graph Neural Networks

Figure 4 for Graph-to-Sequence Learning using Gated Graph Neural Networks

Many NLP applications can be framed as a graph-to-sequence learning problem. Previous work proposing neural architectures on this setting obtained promising results compared to grammar-based approaches but still rely on linearisation heuristics and/or standard recurrent networks to achieve the best performance. In this work, we propose a new model that encodes the full structural information contained in the graph. Our architecture couples the recently proposed Gated Graph Neural Networks with an input transformation that allows nodes and edges to have their own hidden representations, while tackling the parameter explosion problem present in previous work. Experimental results show that our model outperforms strong baselines in generation from AMR graphs and syntax-based neural machine translation.

* ACL 2018

Via

Access Paper or Ask Questions

A Stochastic Decoder for Neural Machine Translation

May 28, 2018
Philip Schulz, Wilker Aziz, Trevor Cohn

Figure 1 for A Stochastic Decoder for Neural Machine Translation

Figure 2 for A Stochastic Decoder for Neural Machine Translation

Figure 3 for A Stochastic Decoder for Neural Machine Translation

Figure 4 for A Stochastic Decoder for Neural Machine Translation

The process of translation is ambiguous, in that there are typically many valid trans- lations for a given sentence. This gives rise to significant variation in parallel cor- pora, however, most current models of machine translation do not account for this variation, instead treating the prob- lem as a deterministic process. To this end, we present a deep generative model of machine translation which incorporates a chain of latent variables, in order to ac- count for local lexical and syntactic varia- tion in parallel corpora. We provide an in- depth analysis of the pitfalls encountered in variational inference for training deep generative models. Experiments on sev- eral different language pairs demonstrate that the model consistently improves over strong baselines.

* Accepted at ACL 2018

Via

Access Paper or Ask Questions

Content-based Popularity Prediction of Online Petitions Using a Deep Regression Model

May 17, 2018
Shivashankar Subramanian, Timothy Baldwin, Trevor Cohn

Figure 1 for Content-based Popularity Prediction of Online Petitions Using a Deep Regression Model

Figure 2 for Content-based Popularity Prediction of Online Petitions Using a Deep Regression Model

Figure 3 for Content-based Popularity Prediction of Online Petitions Using a Deep Regression Model

Figure 4 for Content-based Popularity Prediction of Online Petitions Using a Deep Regression Model

Online petitions are a cost-effective way for citizens to collectively engage with policy-makers in a democracy. Predicting the popularity of a petition --- commonly measured by its signature count --- based on its textual content has utility for policy-makers as well as those posting the petition. In this work, we model this task using CNN regression with an auxiliary ordinal regression objective. We demonstrate the effectiveness of our proposed approach using UK and US government petition datasets.

* ACL 2018 (camera ready pre-print)

Via

Access Paper or Ask Questions

Narrative Modeling with Memory Chains and Semantic Supervision

May 16, 2018
Fei Liu, Trevor Cohn, Timothy Baldwin

Figure 1 for Narrative Modeling with Memory Chains and Semantic Supervision

Figure 2 for Narrative Modeling with Memory Chains and Semantic Supervision

Figure 3 for Narrative Modeling with Memory Chains and Semantic Supervision

Figure 4 for Narrative Modeling with Memory Chains and Semantic Supervision

Story comprehension requires a deep semantic understanding of the narrative, making it a challenging task. Inspired by previous studies on ROC Story Cloze Test, we propose a novel method, tracking various semantic aspects with external neural memory chains while encouraging each to focus on a particular semantic aspect. Evaluated on the task of story ending prediction, our model demonstrates superior performance to a collection of competitive baselines, setting a new state of the art.

* Accepted to ACL 2018 (camera-ready)

Via

Access Paper or Ask Questions

Towards Robust and Privacy-preserving Text Representations

May 16, 2018
Yitong Li, Timothy Baldwin, Trevor Cohn

Figure 1 for Towards Robust and Privacy-preserving Text Representations

Figure 2 for Towards Robust and Privacy-preserving Text Representations

Figure 3 for Towards Robust and Privacy-preserving Text Representations

Figure 4 for Towards Robust and Privacy-preserving Text Representations

Written text often provides sufficient clues to identify the author, their gender, age, and other important attributes. Consequently, the authorship of training and evaluation corpora can have unforeseen impacts, including differing model performance for different user groups, as well as privacy implications. In this paper, we propose an approach to explicitly obscure important author characteristics at training time, such that representations learned are invariant to these attributes. Evaluating on two tasks, we show that this leads to increased privacy in the learned representations, as well as more robust models to varying evaluation conditions, including out-of-domain corpora.

* Accepted to ACL 2018

Via

Access Paper or Ask Questions

What's in a Domain? Learning Domain-Robust Text Representations using Adversarial Training

May 16, 2018
Yitong Li, Timothy Baldwin, Trevor Cohn

Figure 1 for What's in a Domain? Learning Domain-Robust Text Representations using Adversarial Training

Figure 2 for What's in a Domain? Learning Domain-Robust Text Representations using Adversarial Training

Figure 3 for What's in a Domain? Learning Domain-Robust Text Representations using Adversarial Training

Figure 4 for What's in a Domain? Learning Domain-Robust Text Representations using Adversarial Training

Most real world language problems require learning from heterogenous corpora, raising the problem of learning robust models which generalise well to both similar (in domain) and dissimilar (out of domain) instances to those seen in training. This requires learning an underlying task, while not learning irrelevant signals and biases specific to individual domains. We propose a novel method to optimise both in- and out-of-domain accuracy based on joint learning of a structured neural model with domain-specific and domain-general components, coupled with adversarial training for domain. Evaluating on multi-domain language identification and multi-domain sentiment analysis, we show substantial improvements over standard domain adaptation techniques, and domain-adversarial training.

* Accepted to NAACL 2018

Via

Access Paper or Ask Questions