Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Edison Marrese-Taylor

A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews

May 28, 2020
Edison Marrese-Taylor, Cristian Rodriguez-Opazo, Jorge A. Balazs, Stephen Gould, Yutaka Matsuo

Figure 1 for A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews

Figure 2 for A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews

Figure 3 for A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews

Figure 4 for A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews

Despite the recent advances in opinion mining for written reviews, few works have tackled the problem on other sources of reviews. In light of this issue, we propose a multi-modal approach for mining fine-grained opinions from video reviews that is able to determine the aspects of the item under review that are being discussed and the sentiment orientation towards them. Our approach works at the sentence level without the need for time annotations and uses features derived from the audio, video and language transcriptions of its contents. We evaluate our approach on two datasets and show that leveraging the video and audio modalities consistently provides increased performance over text-only baselines, providing evidence these extra modalities are key in better understanding video reviews.

* Second Grand Challenge and Workshop on Multimodal Language ACL 2020

Via

Access Paper or Ask Questions

Variational Inference for Learning Representations of Natural Language Edits

May 18, 2020
Edison Marrese-Taylor, Machel Reid, Yutaka Matsuo

Figure 1 for Variational Inference for Learning Representations of Natural Language Edits

Figure 2 for Variational Inference for Learning Representations of Natural Language Edits

Figure 3 for Variational Inference for Learning Representations of Natural Language Edits

Document editing has become a pervasive component of production of information, with version control systems enabling edits to be efficiently stored and applied. In light of this, the task of learning distributed representations of edits has been recently proposed. With this in mind, we propose a novel approach that employs variational inference to learn a continuous latent space of vector representations to capture the underlying semantic information with regard to the document editing process. We achieve this by introducing a latent variable to explicitly model the aforementioned features. This latent variable is then combined with a document representation to guide the generation of an edited-version of this document. Additionally, to facilitate standardized automatic evaluation of edit representations, which has heavily relied on direct human input thus far, we also propose a suite of downstream tasks, PEER, specifically designed to measure the quality of edit representations in the context of Natural Language Processing.

* 5th Workshop on Representation Learning for NLP (RepL4NLP-2020)

Via

Access Paper or Ask Questions

Combining Pretrained High-Resource Embeddings and Subword Representations for Low-Resource Languages

Mar 11, 2020
Machel Reid, Edison Marrese-Taylor, Yutaka Matsuo

Figure 1 for Combining Pretrained High-Resource Embeddings and Subword Representations for Low-Resource Languages

Figure 2 for Combining Pretrained High-Resource Embeddings and Subword Representations for Low-Resource Languages

The contrast between the need for large amounts of data for current Natural Language Processing (NLP) techniques, and the lack thereof, is accentuated in the case of African languages, most of which are considered low-resource. To help circumvent this issue, we explore techniques exploiting the qualities of morphologically rich languages (MRLs), while leveraging pretrained word vectors in well-resourced languages. In our exploration, we show that a meta-embedding approach combining both pretrained and morphologically-informed word embeddings performs best in the downstream task of Xhosa-English translation.

* To appear at ICLR (International Conference of Learning Representations) 2020 Africa NLP Workshop

Via

Access Paper or Ask Questions

An Edit-centric Approach for Wikipedia Article Quality Assessment

Sep 19, 2019
Edison Marrese-Taylor, Pablo Loyola, Yutaka Matsuo

Figure 1 for An Edit-centric Approach for Wikipedia Article Quality Assessment

Figure 2 for An Edit-centric Approach for Wikipedia Article Quality Assessment

Figure 3 for An Edit-centric Approach for Wikipedia Article Quality Assessment

Figure 4 for An Edit-centric Approach for Wikipedia Article Quality Assessment

We propose an edit-centric approach to assess Wikipedia article quality as a complementary alternative to current full document-based techniques. Our model consists of a main classifier equipped with an auxiliary generative module which, for a given edit, jointly provides an estimation of its quality and generates a description in natural language. We performed an empirical study to assess the feasibility of the proposed model and its cost-effectiveness in terms of data and quality requirements.

* Accepted at the W-NUT Workshop, EMNLP 2019

Via

Access Paper or Ask Questions

Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention

Aug 20, 2019
Cristian Rodriguez Opazo, Edison Marrese-Taylor, Fatemeh Sadat Saleh, Hongdong Li, Stephen Gould

Figure 1 for Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention

Figure 2 for Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention

Figure 3 for Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention

Figure 4 for Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention

This paper studies the problem of temporal moment localization in a long untrimmed video using natural language as the query. Given an untrimmed video and a sentence as the query, the goal is to determine the starting, and the ending, of the relevant visual moment in the video, that corresponds to the query sentence. While previous works have tackled this task by a propose-and-rank approach, we introduce a more efficient, end-to-end trainable, and {\em proposal-free approach} that relies on three key components: a dynamic filter to transfer language information to the visual domain, a new loss function to guide our model to attend the most relevant parts of the video, and soft labels to model annotation uncertainty. We evaluate our method on two benchmark datasets, Charades-STA and ActivityNet-Captions. Experimental results show that our approach outperforms state-of-the-art methods on both datasets.

Via

Access Paper or Ask Questions

Deep contextualized word representations for detecting sarcasm and irony

Sep 26, 2018
Suzana Ilić, Edison Marrese-Taylor, Jorge A. Balazs, Yutaka Matsuo

Figure 1 for Deep contextualized word representations for detecting sarcasm and irony

Figure 2 for Deep contextualized word representations for detecting sarcasm and irony

Predicting context-dependent and non-literal utterances like sarcastic and ironic expressions still remains a challenging task in NLP, as it goes beyond linguistic patterns, encompassing common sense and shared knowledge as crucial components. To capture complex morpho-syntactic features that can usually serve as indicators for irony or sarcasm across dynamic contexts, we propose a model that uses character-level vector representations of words, based on ELMo. We test our model on 7 different datasets derived from 3 different data sources, providing state-of-the-art performance in 6 of them, and otherwise offering competitive results.

* To appear in WASSA 2018

Via

Access Paper or Ask Questions

IIIDYT at IEST 2018: Implicit Emotion Classification With Deep Contextualized Word Representations

Sep 01, 2018
Jorge A. Balazs, Edison Marrese-Taylor, Yutaka Matsuo

Figure 1 for IIIDYT at IEST 2018: Implicit Emotion Classification With Deep Contextualized Word Representations

Figure 2 for IIIDYT at IEST 2018: Implicit Emotion Classification With Deep Contextualized Word Representations

Figure 3 for IIIDYT at IEST 2018: Implicit Emotion Classification With Deep Contextualized Word Representations

Figure 4 for IIIDYT at IEST 2018: Implicit Emotion Classification With Deep Contextualized Word Representations

In this paper we describe our system designed for the WASSA 2018 Implicit Emotion Shared Task (IEST), which obtained 2$^{\text{nd}}$ place out of 26 teams with a test macro F1 score of $0.710$. The system is composed of a single pre-trained ELMo layer for encoding words, a Bidirectional Long-Short Memory Network BiLSTM for enriching word representations with context, a max-pooling operation for creating sentence representations from said word vectors, and a Dense Layer for projecting the sentence representations into label space. Our official submission was obtained by ensembling 6 of these models initialized with different random seeds. The code for replicating this paper is available at https://github.com/jabalazs/implicit_emotion.

* Accepted as a system description paper for the Implicit Emotion Shared Task of WASSA 2018 (EMNLP)

Via

Access Paper or Ask Questions

Learning to Automatically Generate Fill-In-The-Blank Quizzes

Jun 12, 2018
Edison Marrese-Taylor, Ai Nakajima, Yutaka Matsuo, Ono Yuichi

Figure 1 for Learning to Automatically Generate Fill-In-The-Blank Quizzes

Figure 2 for Learning to Automatically Generate Fill-In-The-Blank Quizzes

Figure 3 for Learning to Automatically Generate Fill-In-The-Blank Quizzes

In this paper we formalize the problem automatic fill-in-the-blank question generation using two standard NLP machine learning schemes, proposing concrete deep learning models for each. We present an empirical study based on data obtained from a language learning platform showing that both of our proposed settings offer promising results.

* 5th Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA), collocated with ACL 2018
* 5 pages

Via

Access Paper or Ask Questions

IIIDYT at SemEval-2018 Task 3: Irony detection in English tweets

Apr 22, 2018
Edison Marrese-Taylor, Suzana Ilic, Jorge A. Balazs, Yutaka Matsuo, Helmut Prendinger

Figure 1 for IIIDYT at SemEval-2018 Task 3: Irony detection in English tweets

In this paper we introduce our system for the task of Irony detection in English tweets, a part of SemEval 2018. We propose representation learning approach that relies on a multi-layered bidirectional LSTM, without using external features that provide additional semantic information. Although our model is able to outperform the baseline in the validation set, our results show limited generalization power over the test set. Given the limited size of the dataset, we think the usage of more pre-training schemes would greatly improve the obtained results.

* 4 pages

Via

Access Paper or Ask Questions