Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maria Liakata

GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method

Oct 23, 2020

Nicole Peinelt, Marek Rei, Maria Liakata

Figure 1 for GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method

Figure 2 for GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method

Figure 3 for GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method

Figure 4 for GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method

Abstract:Large pre-trained language models such as BERT have been the driving force behind recent improvements across many NLP tasks. However, BERT is only trained to predict missing words - either behind masks or in the next sentence - and has no knowledge of lexical, syntactic or semantic information beyond what it picks up through unsupervised pre-training. We propose a novel method to explicitly inject linguistic knowledge in the form of word embeddings into any layer of a pre-trained BERT. Our performance improvements on multiple semantic similarity datasets when injecting dependency-based and counter-fitted embeddings indicate that such information is beneficial and currently missing from the original model. Our qualitative analysis shows that counter-fitted embedding injection particularly helps with cases involving synonym pairs.

Via

Access Paper or Ask Questions

QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Aug 30, 2020

Rabab Alkhalifa, Theodore Yoong, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata

Figure 1 for QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Figure 2 for QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Figure 3 for QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Figure 4 for QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Abstract:This paper describes the participation of the QMUL-SDS team for Task 1 of the CLEF 2020 CheckThat! shared task. The purpose of this task is to determine the check-worthiness of tweets about COVID-19 to identify and prioritise tweets that need fact-checking. The overarching aim is to further support ongoing efforts to protect the public from fake news and help people find reliable information. We describe and analyse the results of our submissions. We show that a CNN using COVID-Twitter-BERT (CT-BERT) enhanced with numeric expressions can effectively boost performance from baseline results. We also show results of training data augmentation with rumours on other topics. Our best system ranked fourth in the task with encouraging outcomes showing potential for improved results in the future.

Via

Access Paper or Ask Questions

Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews

Aug 08, 2020

Bo Wang, Yue Wu, Niall Taylor, Terry Lyons, Maria Liakata, Alejo J Nevado-Holgado, Kate E A Saunders

Figure 1 for Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews

Figure 2 for Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews

Figure 3 for Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews

Figure 4 for Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews

Abstract:Bipolar disorder (BD) and borderline personality disorder (BPD) are both chronic psychiatric disorders. However, their overlapping symptoms and common comorbidity make it challenging for the clinicians to distinguish the two conditions on the basis of a clinical interview. In this work, we first present a new multi-modal dataset containing interviews involving individuals with BD or BPD being interviewed about a non-clinical topic . We investigate the automatic detection of the two conditions, and demonstrate a good linear classifier that can be learnt using a down-selected set of features from the different aspects of the interviews and a novel approach of summarising these features. Finally, we find that different sets of features characterise BD and BPD, thus providing insights into the difference between the automatic screening of the two conditions.

Via

Access Paper or Ask Questions

Measuring prominence of scientific work in online news as a proxy for impact

Jul 28, 2020

James Ravenscroft, Amanda Clare, Maria Liakata

Figure 1 for Measuring prominence of scientific work in online news as a proxy for impact

Figure 2 for Measuring prominence of scientific work in online news as a proxy for impact

Figure 3 for Measuring prominence of scientific work in online news as a proxy for impact

Figure 4 for Measuring prominence of scientific work in online news as a proxy for impact

Abstract:The impact made by a scientific paper on the work of other academics has many established metrics, including metrics based on citation counts and social media commenting. However, determination of the impact of a scientific paper on the wider society is less well established. For example, is it important for scientific work to be newsworthy? Here we present a new corpus of newspaper articles linked to the scientific papers that they describe. We find that Impact Case studies submitted to the UK Research Excellence Framework (REF) 2014 that refer to scientific papers mentioned in newspaper articles were awarded a higher score in the REF assessment. The papers associated with these case studies also feature prominently in the newspaper articles. We hypothesise that such prominence can be a useful proxy for societal impact. We therefore provide a novel baseline approach for measuring the prominence of scientific papers mentioned within news articles. Our measurement of prominence is based on semantic similarity through a graph-based ranking algorithm. We find that scientific papers with an associated REF case study are more likely to have a stronger prominence score. This supports our hypothesis that linguistic prominence in news can be used to suggest the wider non-academic impact of scientific work.

* 13 pages, 5 figures

Via

Access Paper or Ask Questions

Better Early than Late: Fusing Topics with Word Embeddings for Neural Question Paraphrase Identification

Jul 22, 2020

Nicole Peinelt, Dong Nguyen, Maria Liakata

Figure 1 for Better Early than Late: Fusing Topics with Word Embeddings for Neural Question Paraphrase Identification

Figure 2 for Better Early than Late: Fusing Topics with Word Embeddings for Neural Question Paraphrase Identification

Figure 3 for Better Early than Late: Fusing Topics with Word Embeddings for Neural Question Paraphrase Identification

Figure 4 for Better Early than Late: Fusing Topics with Word Embeddings for Neural Question Paraphrase Identification

Abstract:Question paraphrase identification is a key task in Community Question Answering (CQA) to determine if an incoming question has been previously asked. Many current models use word embeddings to identify duplicate questions, but the use of topic models in feature-engineered systems suggests that they can be helpful for this task, too. We therefore propose two ways of merging topics with word embeddings (early vs. late fusion) in a new neural architecture for question paraphrase identification. Our results show that our system outperforms neural baselines on multiple CQA datasets, while an ablation study highlights the importance of topics and especially early topic-embedding fusion in our architecture.

Via

Access Paper or Ask Questions

Estimating predictive uncertainty for rumour verification models

May 14, 2020

Elena Kochkina, Maria Liakata

Figure 1 for Estimating predictive uncertainty for rumour verification models

Figure 2 for Estimating predictive uncertainty for rumour verification models

Figure 3 for Estimating predictive uncertainty for rumour verification models

Figure 4 for Estimating predictive uncertainty for rumour verification models

Abstract:The inability to correctly resolve rumours circulating online can have harmful real-world consequences. We present a method for incorporating model and data uncertainty estimates into natural language processing models for automatic rumour verification. We show that these estimates can be used to filter out model predictions likely to be erroneous, so that these difficult instances can be prioritised by a human fact-checker. We propose two methods for uncertainty-based instance rejection, supervised and unsupervised. We also show how uncertainty estimates can be used to interpret model performance as a rumour unfolds.

* Accepted to the Annual Conference of the Association for Computational Linguistics (ACL) 2020

Via

Access Paper or Ask Questions

Autoencoding Word Representations through Time for Semantic Change Detection

Apr 28, 2020

Adam Tsakalidis, Maria Liakata

Figure 1 for Autoencoding Word Representations through Time for Semantic Change Detection

Figure 2 for Autoencoding Word Representations through Time for Semantic Change Detection

Figure 3 for Autoencoding Word Representations through Time for Semantic Change Detection

Figure 4 for Autoencoding Word Representations through Time for Semantic Change Detection

Abstract:Semantic change detection concerns the task of identifying words whose meaning has changed over time. The current state-of-the-art detects the level of semantic change in a word by comparing its vector representation in two distinct time periods, without considering its evolution through time. In this work, we propose three variants of sequential models for detecting semantically shifted words, effectively accounting for the changes in the word representations over time, in a temporally sensitive manner. Through extensive experimentation under various settings with both synthetic and real data we showcase the importance of sequential modelling of word vectors through time for detecting the words whose semantics have changed the most. Finally, we take a step towards comparing different approaches in a quantitative manner, demonstrating that the temporal modelling of word representations yields a clear-cut advantage in performance.

Via

Access Paper or Ask Questions

How we do things with words: Analyzing text as social and cultural data

Jul 02, 2019

Dong Nguyen, Maria Liakata, Simon DeDeo, Jacob Eisenstein, David Mimno, Rebekah Tromble, Jane Winters

Abstract:In this article we describe our experiences with computational text analysis. We hope to achieve three primary goals. First, we aim to shed light on thorny issues not always at the forefront of discussions about computational text analysis methods. Second, we hope to provide a set of best practices for working with thick social and cultural concepts. Our guidance is based on our own experiences and is therefore inherently imperfect. Still, given our diversity of disciplinary backgrounds and research practices, we hope to capture a range of ideas and identify commonalities that will resonate for many. And this leads to our final goal: to help promote interdisciplinary collaborations. Interdisciplinary insights and partnerships are essential for realizing the full potential of any computational text analysis that involves social and cultural concepts, and the more we are able to bridge these divides, the more fruitful we believe our work will be.

Via

Access Paper or Ask Questions

RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Sep 18, 2018

Genevieve Gorrell, Kalina Bontcheva, Leon Derczynski, Elena Kochkina, Maria Liakata, Arkaitz Zubiaga

Figure 1 for RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Figure 2 for RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Figure 3 for RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Figure 4 for RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Abstract:This is the proposal for RumourEval-2019, which will run in early 2019 as part of that year's SemEval event. Since the first RumourEval shared task in 2017, interest in automated claim validation has greatly increased, as the dangers of "fake news" have become a mainstream concern. Yet automated support for rumour checking remains in its infancy. For this reason, it is important that a shared task in this area continues to provide a focus for effort, which is likely to increase. We therefore propose a continuation in which the veracity of further rumours is determined, and as previously, supportive of this goal, tweets discussing them are classified according to the stance they take regarding the rumour. Scope is extended compared with the first RumourEval, in that the dataset is substantially expanded to include Reddit as well as Twitter data, and additional languages are also included.

Via

Access Paper or Ask Questions

Nowcasting the Stance of Social Media Users in a Sudden Vote: The Case of the Greek Referendum

Aug 26, 2018

Adam Tsakalidis, Nikolaos Aletras, Alexandra I. Cristea, Maria Liakata

Abstract:Modelling user voting intention in social media is an important research area, with applications in analysing electorate behaviour, online political campaigning and advertising. Previous approaches mainly focus on predicting national general elections, which are regularly scheduled and where data of past results and opinion polls are available. However, there is no evidence of how such models would perform during a sudden vote under time-constrained circumstances. That poses a more challenging task compared to traditional elections, due to its spontaneous nature. In this paper, we focus on the 2015 Greek bailout referendum, aiming to nowcast on a daily basis the voting intention of 2,197 Twitter users. We propose a semi-supervised multiple convolution kernel learning approach, leveraging temporally sensitive text and network information. Our evaluation under a real-time simulation framework demonstrates the effectiveness and robustness of our approach against competitive baselines, achieving a significant 20% increase in F-score compared to solely text-based models.

* Preprint accepted for publication in the ACM International Conference on Information and Knowledge Management (CIKM 2018)

Via

Access Paper or Ask Questions