Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Miguel Ballesteros

Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization

Mar 21, 2023

Ming Shen, Jie Ma, Shuai Wang, Yogarshi Vyas, Kalpit Dixit, Miguel Ballesteros, Yassine Benajiba

Abstract:Opinion summarization provides an important solution for summarizing opinions expressed among a large number of reviews. However, generating aspect-specific and general summaries is challenging due to the lack of annotated data. In this work, we propose two simple yet effective unsupervised approaches to generate both aspect-specific and general opinion summaries by training on synthetic datasets constructed with aspect-related review contents. Our first approach, Seed Words Based Leave-One-Out (SW-LOO), identifies aspect-related portions of reviews simply by exact-matching aspect seed words and outperforms existing methods by 3.4 ROUGE-L points on SPACE and 0.5 ROUGE-1 point on OPOSUM+ for aspect-specific opinion summarization. Our second approach, Natural Language Inference Based Leave-One-Out (NLI-LOO) identifies aspect-related sentences utilizing an NLI model in a more general setting without using seed words and outperforms existing approaches by 1.2 ROUGE-L points on SPACE for aspect-specific opinion summarization and remains competitive on other metrics.

* EACL 2023 Findings

Via

Access Paper or Ask Questions

Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views

Feb 23, 2023

Katerina Margatina, Shuai Wang, Yogarshi Vyas, Neha Anna John, Yassine Benajiba, Miguel Ballesteros

Figure 1 for Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views

Figure 2 for Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views

Figure 3 for Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views

Figure 4 for Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views

Abstract:Temporal concept drift refers to the problem of data changing over time. In NLP, that would entail that language (e.g. new expressions, meaning shifts) and factual knowledge (e.g. new concepts, updated facts) evolve over time. Focusing on the latter, we benchmark $11$ pretrained masked language models (MLMs) on a series of tests designed to evaluate the effect of temporal concept drift, as it is crucial that widely used language models remain up-to-date with the ever-evolving factual updates of the real world. Specifically, we provide a holistic framework that (1) dynamically creates temporal test sets of any time granularity (e.g. month, quarter, year) of factual data from Wikidata, (2) constructs fine-grained splits of tests (e.g. updated, new, unchanged facts) to ensure comprehensive analysis, and (3) evaluates MLMs in three distinct ways (single-token probing, multi-token generation, MLM scoring). In contrast to prior work, our framework aims to unveil how robust an MLM is over time and thus to provide a signal in case it has become outdated, by leveraging multiple views of evaluation.

* To appear at EACL 2023. Our code will be available at https://github.com/amazon-science/temporal-robustness

Via

Access Paper or Ask Questions

Novel Chapter Abstractive Summarization using Spinal Tree Aware Sub-Sentential Content Selection

Nov 09, 2022

Hardy Hardy, Miguel Ballesteros, Faisal Ladhak, Muhammad Khalifa, Vittorio Castelli, Kathleen McKeown

Abstract:Summarizing novel chapters is a difficult task due to the input length and the fact that sentences that appear in the desired summaries draw content from multiple places throughout the chapter. We present a pipelined extractive-abstractive approach where the extractive step filters the content that is passed to the abstractive component. Extremely lengthy input also results in a highly skewed dataset towards negative instances for extractive summarization; we thus adopt a margin ranking loss for extraction to encourage separation between positive and negative examples. Our extraction component operates at the constituent level; our approach to this problem enriches the text with spinal tree information which provides syntactic context (in the form of constituents) to the extraction model. We show an improvement of 3.71 Rouge-1 points over best results reported in prior work on an existing novel chapter dataset.

Via

Access Paper or Ask Questions

Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis

Oct 12, 2022

Siddharth Varia, Shuai Wang, Kishaloy Halder, Robert Vacareanu, Miguel Ballesteros, Yassine Benajiba, Neha Anna John, Rishita Anubhai, Smaranda Muresan, Dan Roth

Figure 1 for Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis

Figure 2 for Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis

Figure 3 for Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis

Figure 4 for Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis

Abstract:Aspect-based Sentiment Analysis (ABSA) is a fine-grained sentiment analysis task which involves four elements from user-generated texts: aspect term, aspect category, opinion term, and sentiment polarity. Most computational approaches focus on some of the ABSA sub-tasks such as tuple (aspect term, sentiment polarity) or triplet (aspect term, opinion term, sentiment polarity) extraction using either pipeline or joint modeling approaches. Recently, generative approaches have been proposed to extract all four elements as (one or more) quadruplets from text as a single task. In this work, we take a step further and propose a unified framework for solving ABSA, and the associated sub-tasks to improve the performance in few-shot scenarios. To this end, we fine-tune a T5 model with instructional prompts in a multi-task learning fashion covering all the sub-tasks, as well as the entire quadruple prediction task. In experiments with multiple benchmark data sets, we show that the proposed multi-task prompting approach brings performance boost (by absolute $6.75$ F1) in the few-shot learning setting.

Via

Access Paper or Ask Questions

Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

Oct 11, 2022

Muhammad Khalifa, Yogarshi Vyas, Shuai Wang, Graham Horwood, Sunil Mallya, Miguel Ballesteros

Figure 1 for Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

Figure 2 for Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

Figure 3 for Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

Figure 4 for Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

Abstract:We investigate semi-structured document classification in a zero-shot setting. Classification of semi-structured documents is more challenging than that of standard unstructured documents, as positional, layout, and style information play a vital role in interpreting such documents. The standard classification setting where categories are fixed during both training and testing falls short in dynamic environments where new document categories could potentially emerge. We focus exclusively on the zero-shot setting where inference is done on new unseen classes. To address this task, we propose a matching-based approach that relies on a pairwise contrastive objective for both pretraining and fine-tuning. Our results show a significant boost in Macro F$_1$ from the proposed pretraining step in both supervised and unsupervised zero-shot settings.

Via

Access Paper or Ask Questions

Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning

Apr 23, 2022

Vishakh Padmakumar, Leonard Lausen, Miguel Ballesteros, Sheng Zha, He He, George Karypis

Figure 1 for Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning

Figure 2 for Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning

Figure 3 for Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning

Figure 4 for Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning

Abstract:Recent work has found that multi-task training with a large number of diverse tasks can uniformly improve downstream performance on unseen target tasks. In contrast, literature on task transferability has established that the choice of intermediate tasks can heavily affect downstream task performance. In this work, we aim to disentangle the effect of scale and relatedness of tasks in multi-task representation learning. We find that, on average, increasing the scale of multi-task learning, in terms of the number of tasks, indeed results in better learned representations than smaller multi-task setups. However, if the target tasks are known ahead of time, then training on a smaller set of related tasks is competitive to the large-scale multi-task training at a reduced computational cost.

* Accepted to appear at NAACL 2022

Via

Access Paper or Ask Questions

Label Semantics for Few Shot Named Entity Recognition

Mar 16, 2022

Jie Ma, Miguel Ballesteros, Srikanth Doss, Rishita Anubhai, Sunil Mallya, Yaser Al-Onaizan, Dan Roth

Figure 1 for Label Semantics for Few Shot Named Entity Recognition

Figure 2 for Label Semantics for Few Shot Named Entity Recognition

Figure 3 for Label Semantics for Few Shot Named Entity Recognition

Figure 4 for Label Semantics for Few Shot Named Entity Recognition

Abstract:We study the problem of few shot learning for named entity recognition. Specifically, we leverage the semantic information in the names of the labels as a way of giving the model additional signal and enriched priors. We propose a neural architecture that consists of two BERT encoders, one to encode the document and its tokens and another one to encode each of the labels in natural language format. Our model learns to match the representations of named entities computed by the first encoder with label representations computed by the second encoder. The label semantics signal is shown to support improved state-of-the-art results in multiple few shot NER benchmarks and on-par performance in standard benchmarks. Our model is especially effective in low resource settings.

* Findings of ACL 2022

Via

Access Paper or Ask Questions

A Bag of Tricks for Dialogue Summarization

Sep 16, 2021

Muhammad Khalifa, Miguel Ballesteros, Kathleen McKeown

Figure 1 for A Bag of Tricks for Dialogue Summarization

Figure 2 for A Bag of Tricks for Dialogue Summarization

Figure 3 for A Bag of Tricks for Dialogue Summarization

Figure 4 for A Bag of Tricks for Dialogue Summarization

Abstract:Dialogue summarization comes with its own peculiar challenges as opposed to news or scientific articles summarization. In this work, we explore four different challenges of the task: handling and differentiating parts of the dialogue belonging to multiple speakers, negation understanding, reasoning about the situation, and informal language understanding. Using a pretrained sequence-to-sequence language model, we explore speaker name substitution, negation scope highlighting, multi-task learning with relevant tasks, and pretraining on in-domain data. Our experiments show that our proposed techniques indeed improve summarization performance, outperforming strong baselines.

* EMNLP 2021 - short paper

Via

Access Paper or Ask Questions

How much pretraining data do language models need to learn syntax?

Sep 09, 2021

Laura Pérez-Mayos, Miguel Ballesteros, Leo Wanner

Figure 1 for How much pretraining data do language models need to learn syntax?

Figure 2 for How much pretraining data do language models need to learn syntax?

Figure 3 for How much pretraining data do language models need to learn syntax?

Figure 4 for How much pretraining data do language models need to learn syntax?

Abstract:Transformers-based pretrained language models achieve outstanding results in many well-known NLU benchmarks. However, while pretraining methods are very convenient, they are expensive in terms of time and resources. This calls for a study of the impact of pretraining data size on the knowledge of the models. We explore this impact on the syntactic capabilities of RoBERTa, using models trained on incremental sizes of raw text data. First, we use syntactic structural probes to determine whether models pretrained on more data encode a higher amount of syntactic information. Second, we perform a targeted syntactic evaluation to analyze the impact of pretraining data size on the syntactic generalization performance of the models. Third, we compare the performance of the different models on three downstream applications: part-of-speech tagging, dependency parsing and paraphrase identification. We complement our study with an analysis of the cost-benefit trade-off of training such models. Our experiments show that while models pretrained on more data encode more syntactic knowledge and perform better on downstream applications, they do not always offer a better performance across the different syntactic phenomena and come at a higher financial and environmental cost.

* To be published in proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Via

Access Paper or Ask Questions

Sequential Cross-Document Coreference Resolution

Apr 17, 2021

Emily Allaway, Shuai Wang, Miguel Ballesteros

Figure 1 for Sequential Cross-Document Coreference Resolution

Figure 2 for Sequential Cross-Document Coreference Resolution

Figure 3 for Sequential Cross-Document Coreference Resolution

Figure 4 for Sequential Cross-Document Coreference Resolution

Abstract:Relating entities and events in text is a key component of natural language understanding. Cross-document coreference resolution, in particular, is important for the growing interest in multi-document analysis tasks. In this work we propose a new model that extends the efficient sequential prediction paradigm for coreference resolution to cross-document settings and achieves competitive results for both entity and event coreference while provides strong evidence of the efficacy of both sequential models and higher-order inference in cross-document settings. Our model incrementally composes mentions into cluster representations and predicts links between a mention and the already constructed clusters, approximating a higher-order model. In addition, we conduct extensive ablation studies that provide new insights into the importance of various inputs and representation types in coreference.

Via

Access Paper or Ask Questions