Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Elena Kochkina

Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies

Feb 16, 2021

Gabriele Pergola, Elena Kochkina, Lin Gui, Maria Liakata, Yulan He

Figure 1 for Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies

Figure 2 for Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies

Figure 3 for Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies

Figure 4 for Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies

Abstract:Biomedical question-answering (QA) has gained increased attention for its capability to provide users with high-quality information from a vast scientific literature. Although an increasing number of biomedical QA datasets has been recently made available, those resources are still rather limited and expensive to produce. Transfer learning via pre-trained language models (LMs) has been shown as a promising approach to leverage existing general-purpose knowledge. However, finetuning these large models can be costly and time consuming, often yielding limited benefits when adapting to specific themes of specialised domains, such as the COVID-19 literature. To bootstrap further their domain adaptation, we propose a simple yet unexplored approach, which we call biomedical entity-aware masking (BEM). We encourage masked language models to learn entity-centric knowledge based on the pivotal entities characterizing the domain at hand, and employ those entities to drive the LM fine-tuning. The resulting strategy is a downstream process applicable to a wide variety of masked LMs, not requiring additional memory or components in the neural architectures. Experimental results show performance on par with state-of-the-art models on several biomedical QA datasets.

* EACL 2021 - Short Paper - European Chapter of the Association for Computational Linguistics

Via

Access Paper or Ask Questions

QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Aug 30, 2020

Rabab Alkhalifa, Theodore Yoong, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata

Figure 1 for QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Figure 2 for QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Figure 3 for QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Figure 4 for QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

Abstract:This paper describes the participation of the QMUL-SDS team for Task 1 of the CLEF 2020 CheckThat! shared task. The purpose of this task is to determine the check-worthiness of tweets about COVID-19 to identify and prioritise tweets that need fact-checking. The overarching aim is to further support ongoing efforts to protect the public from fake news and help people find reliable information. We describe and analyse the results of our submissions. We show that a CNN using COVID-Twitter-BERT (CT-BERT) enhanced with numeric expressions can effectively boost performance from baseline results. We also show results of training data augmentation with rumours on other topics. Our best system ranked fourth in the task with encouraging outcomes showing potential for improved results in the future.

Via

Access Paper or Ask Questions

Estimating predictive uncertainty for rumour verification models

May 14, 2020

Elena Kochkina, Maria Liakata

Figure 1 for Estimating predictive uncertainty for rumour verification models

Figure 2 for Estimating predictive uncertainty for rumour verification models

Figure 3 for Estimating predictive uncertainty for rumour verification models

Figure 4 for Estimating predictive uncertainty for rumour verification models

Abstract:The inability to correctly resolve rumours circulating online can have harmful real-world consequences. We present a method for incorporating model and data uncertainty estimates into natural language processing models for automatic rumour verification. We show that these estimates can be used to filter out model predictions likely to be erroneous, so that these difficult instances can be prioritised by a human fact-checker. We propose two methods for uncertainty-based instance rejection, supervised and unsupervised. We also show how uncertainty estimates can be used to interpret model performance as a rumour unfolds.

* Accepted to the Annual Conference of the Association for Computational Linguistics (ACL) 2020

Via

Access Paper or Ask Questions

Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

Mar 16, 2020

Harish Tayyar Madabushi, Elena Kochkina, Michael Castelle

Figure 1 for Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

Figure 2 for Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

Figure 3 for Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

Figure 4 for Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

Abstract:The automatic identification of propaganda has gained significance in recent years due to technological and social changes in the way news is generated and consumed. That this task can be addressed effectively using BERT, a powerful new architecture which can be fine-tuned for text classification tasks, is not surprising. However, propaganda detection, like other tasks that deal with news documents and other forms of decontextualized social communication (e.g. sentiment analysis), inherently deals with data whose categories are simultaneously imbalanced and dissimilar. We show that BERT, while capable of handling imbalanced classes with no additional data augmentation, does not generalise well when the training and test data are sufficiently dissimilar (as is often the case with news sources, whose topics evolve over time). We show how to address this problem by providing a statistical measure of similarity between datasets and a method of incorporating cost-weighting into BERT when the training and test sets are dissimilar. We test these methods on the Propaganda Techniques Corpus (PTC) and achieve the second-highest score on sentence-level propaganda classification.

* NLP4IF 2019

Via

Access Paper or Ask Questions

RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Sep 18, 2018

Genevieve Gorrell, Kalina Bontcheva, Leon Derczynski, Elena Kochkina, Maria Liakata, Arkaitz Zubiaga

Figure 1 for RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Figure 2 for RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Figure 3 for RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Figure 4 for RumourEval 2019: Determining Rumour Veracity and Support for Rumours

Abstract:This is the proposal for RumourEval-2019, which will run in early 2019 as part of that year's SemEval event. Since the first RumourEval shared task in 2017, interest in automated claim validation has greatly increased, as the dangers of "fake news" have become a mainstream concern. Yet automated support for rumour checking remains in its infancy. For this reason, it is important that a shared task in this area continues to provide a focus for effort, which is likely to increase. We therefore propose a continuation in which the veracity of further rumours is determined, and as previously, supportive of this goal, tweets discussing them are classified according to the stance they take regarding the rumour. Scope is extended compared with the first RumourEval, in that the dataset is substantially expanded to include Reddit as well as Twitter data, and additional languages are also included.

Via

Access Paper or Ask Questions

All-in-one: Multi-task Learning for Rumour Verification

Jun 10, 2018

Elena Kochkina, Maria Liakata, Arkaitz Zubiaga

Figure 1 for All-in-one: Multi-task Learning for Rumour Verification

Figure 2 for All-in-one: Multi-task Learning for Rumour Verification

Figure 3 for All-in-one: Multi-task Learning for Rumour Verification

Figure 4 for All-in-one: Multi-task Learning for Rumour Verification

Abstract:Automatic resolution of rumours is a challenging task that can be broken down into smaller components that make up a pipeline, including rumour detection, rumour tracking and stance classification, leading to the final outcome of determining the veracity of a rumour. In previous work, these steps in the process of rumour verification have been developed as separate components where the output of one feeds into the next. We propose a multi-task learning approach that allows joint training of the main and auxiliary tasks, improving the performance of rumour verification. We examine the connection between the dataset properties and the outcomes of the multi-task learning models used.

Via

Access Paper or Ask Questions

Discourse-Aware Rumour Stance Classification in Social Media Using Sequential Classifiers

Dec 06, 2017

Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik, Kalina Bontcheva, Trevor Cohn, Isabelle Augenstein

Figure 1 for Discourse-Aware Rumour Stance Classification in Social Media Using Sequential Classifiers

Figure 2 for Discourse-Aware Rumour Stance Classification in Social Media Using Sequential Classifiers

Figure 3 for Discourse-Aware Rumour Stance Classification in Social Media Using Sequential Classifiers

Figure 4 for Discourse-Aware Rumour Stance Classification in Social Media Using Sequential Classifiers

Abstract:Rumour stance classification, defined as classifying the stance of specific social media posts into one of supporting, denying, querying or commenting on an earlier post, is becoming of increasing interest to researchers. While most previous work has focused on using individual tweets as classifier inputs, here we report on the performance of sequential classifiers that exploit the discourse features inherent in social media interactions or 'conversational threads'. Testing the effectiveness of four sequential classifiers -- Hawkes Processes, Linear-Chain Conditional Random Fields (Linear CRF), Tree-Structured Conditional Random Fields (Tree CRF) and Long Short Term Memory networks (LSTM) -- on eight datasets associated with breaking news stories, and looking at different types of local and contextual features, our work sheds new light on the development of accurate stance classifiers. We show that sequential classifiers that exploit the use of discourse properties in social media conversations while using only local features, outperform non-sequential classifiers. Furthermore, we show that LSTM using a reduced set of features can outperform the other sequential classifiers; this performance is consistent across datasets and across types of stances. To conclude, our work also analyses the different features under study, identifying those that best help characterise and distinguish between stances, such as supporting tweets being more likely to be accompanied by evidence than denying tweets. We also set forth a number of directions for future research.

* Information Processing & Management, Volume 54, Issue 2, March 2018, Pages 273-290

Via

Access Paper or Ask Questions

Turing at SemEval-2017 Task 8: Sequential Approach to Rumour Stance Classification with Branch-LSTM

Apr 24, 2017

Elena Kochkina, Maria Liakata, Isabelle Augenstein

Figure 1 for Turing at SemEval-2017 Task 8: Sequential Approach to Rumour Stance Classification with Branch-LSTM

Figure 2 for Turing at SemEval-2017 Task 8: Sequential Approach to Rumour Stance Classification with Branch-LSTM

Figure 3 for Turing at SemEval-2017 Task 8: Sequential Approach to Rumour Stance Classification with Branch-LSTM

Figure 4 for Turing at SemEval-2017 Task 8: Sequential Approach to Rumour Stance Classification with Branch-LSTM

Abstract:This paper describes team Turing's submission to SemEval 2017 RumourEval: Determining rumour veracity and support for rumours (SemEval 2017 Task 8, Subtask A). Subtask A addresses the challenge of rumour stance classification, which involves identifying the attitude of Twitter users towards the truthfulness of the rumour they are discussing. Stance classification is considered to be an important step towards rumour verification, therefore performing well in this task is expected to be useful in debunking false rumours. In this work we classify a set of Twitter posts discussing rumours into either supporting, denying, questioning or commenting on the underlying rumours. We propose a LSTM-based sequential model that, through modelling the conversational structure of tweets, which achieves an accuracy of 0.784 on the RumourEval test set outperforming all other systems in Subtask A.

* SemEval 2017 RumourEval: Determining rumour veracity and support for rumours (SemEval 2017 Task 8, Subtask A)

Via

Access Paper or Ask Questions

Stance Classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations

Oct 11, 2016

Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik

Figure 1 for Stance Classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations

Figure 2 for Stance Classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations

Figure 3 for Stance Classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations

Figure 4 for Stance Classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations

Abstract:Rumour stance classification, the task that determines if each tweet in a collection discussing a rumour is supporting, denying, questioning or simply commenting on the rumour, has been attracting substantial interest. Here we introduce a novel approach that makes use of the sequence of transitions observed in tree-structured conversation threads in Twitter. The conversation threads are formed by harvesting users' replies to one another, which results in a nested tree-like structure. Previous work addressing the stance classification task has treated each tweet as a separate unit. Here we analyse tweets by virtue of their position in a sequence and test two sequential classifiers, Linear-Chain CRF and Tree CRF, each of which makes different assumptions about the conversational structure. We experiment with eight Twitter datasets, collected during breaking news, and show that exploiting the sequential structure of Twitter conversations achieves significant improvements over the non-sequential methods. Our work is the first to model Twitter conversations as a tree structure in this manner, introducing a novel way of tackling NLP tasks on Twitter conversations.

* COLING 2016

Via

Access Paper or Ask Questions