Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yangfeng Ji

Pointwise Paraphrase Appraisal is Potentially Problematic

Jun 05, 2020

Hannah Chen, Yangfeng Ji, David Evans

Figure 1 for Pointwise Paraphrase Appraisal is Potentially Problematic

Figure 2 for Pointwise Paraphrase Appraisal is Potentially Problematic

Figure 3 for Pointwise Paraphrase Appraisal is Potentially Problematic

Figure 4 for Pointwise Paraphrase Appraisal is Potentially Problematic

Abstract:The prevailing approach for training and evaluating paraphrase identification models is constructed as a binary classification problem: the model is given a pair of sentences, and is judged by how accurately it classifies pairs as either paraphrases or non-paraphrases. This pointwise-based evaluation method does not match well the objective of most real world applications, so the goal of our work is to understand how models which perform well under pointwise evaluation may fail in practice and find better methods for evaluating paraphrase identification models. As a first step towards that goal, we show that although the standard way of fine-tuning BERT for paraphrase identification by pairing two sentences as one sequence results in a model with state-of-the-art performance, that model may perform poorly on simple tasks like identifying pairs with two identical sentences. Moreover, we show that these models may even predict a pair of randomly-selected sentences with higher paraphrase score than a pair of identical ones.

* ACL 2020 Student Research Workshop

Via

Access Paper or Ask Questions

Reevaluating Adversarial Examples in Natural Language

Apr 25, 2020

John X. Morris, Eli Lifland, Jack Lanchantin, Yangfeng Ji, Yanjun Qi

Figure 1 for Reevaluating Adversarial Examples in Natural Language

Figure 2 for Reevaluating Adversarial Examples in Natural Language

Figure 3 for Reevaluating Adversarial Examples in Natural Language

Figure 4 for Reevaluating Adversarial Examples in Natural Language

Abstract:State-of-the-art attacks on NLP models have different definitions of what constitutes a successful attack. These differences make the attacks difficult to compare. We propose to standardize definitions of natural language adversarial examples based on a set of linguistic constraints: semantics, grammaticality, edit distance, and non-suspicion. We categorize previous attacks based on these constraints. For each constraint, we suggest options for human and automatic evaluation methods. We use these methods to evaluate two state-of-the-art synonym substitution attacks. We find that perturbations often do not preserve semantics, and 45\% introduce grammatical errors. Next, we conduct human studies to find a threshold for each evaluation method that aligns with human judgment. Human surveys reveal that to truly preserve semantics, we need to significantly increase the minimum cosine similarity between the embeddings of swapped words and sentence encodings of original and perturbed inputs. After tightening these constraints to agree with the judgment of our human annotators, the attacks produce valid, successful adversarial examples. But quality comes at a cost: attack success rate drops by over 70 percentage points. Finally, we introduce TextAttack, a library for adversarial attacks in NLP.

* 14 pages; 10 Tables; 4 Figures

Via

Access Paper or Ask Questions

Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection

Apr 04, 2020

Hanjie Chen, Guangtao Zheng, Yangfeng Ji

Figure 1 for Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection

Figure 2 for Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection

Figure 3 for Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection

Figure 4 for Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection

Abstract:Generating explanations for neural networks has become crucial for their applications in real-world with respect to reliability and trustworthiness. In natural language processing, existing methods usually provide important features which are words or phrases selected from an input text as an explanation, but ignore the interactions between them. It poses challenges for humans to interpret an explanation and connect it to model prediction. In this work, we build hierarchical explanations by detecting feature interactions. Such explanations visualize how words and phrases are combined at different levels of the hierarchy, which can help users understand the decision-making of black-box models. The proposed method is evaluated with three neural text classifiers (LSTM, CNN, and BERT) on two benchmark datasets, via both automatic and human evaluations. Experiments show the effectiveness of the proposed method in providing explanations that are both faithful to models and interpretable to humans.

* Accepted to ACL 2020

Via

Access Paper or Ask Questions

Improving the Explainability of Neural Sentiment Classifiers via Data Augmentation

Oct 09, 2019

Hanjie Chen, Yangfeng Ji

Figure 1 for Improving the Explainability of Neural Sentiment Classifiers via Data Augmentation

Figure 2 for Improving the Explainability of Neural Sentiment Classifiers via Data Augmentation

Figure 3 for Improving the Explainability of Neural Sentiment Classifiers via Data Augmentation

Figure 4 for Improving the Explainability of Neural Sentiment Classifiers via Data Augmentation

Abstract:Sentiment analysis has been widely used by businesses for social media opinion mining, especially in the financial services industry, where customers' feedbacks are critical for companies. Recent progress of neural network models has achieved remarkable performance on sentiment classification, while the lack of classification interpretation may raise the trustworthy and many other issues in practice. In this work, we study the problem of improving the explainability of existing sentiment classifiers. We propose two data augmentation methods that create additional training examples to help improve model explainability: one method with a predefined sentiment word list as external knowledge and the other with adversarial examples. We test the proposed methods on both CNN and RNN classifiers with three benchmark sentiment datasets. The model explainability is assessed by both human evaluators and a simple automatic evaluation measurement. Experiments show the proposed data augmentation methods significantly improve the explainability of both neural classifiers.

* 11 pages, NeurIPS 2019 Workshop on Robust AI in Financial Services

Via

Access Paper or Ask Questions

An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation

Aug 28, 2019

Wanyu Du, Yangfeng Ji

Figure 1 for An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation

Figure 2 for An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation

Figure 3 for An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation

Figure 4 for An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation

Abstract:Generating paraphrases from given sentences involves decoding words step by step from a large vocabulary. To learn a decoder, supervised learning which maximizes the likelihood of tokens always suffers from the exposure bias. Although both reinforcement learning (RL) and imitation learning (IL) have been widely used to alleviate the bias, the lack of direct comparison leads to only a partial image on their benefits. In this work, we present an empirical study on how RL and IL can help boost the performance of generating paraphrases, with the pointer-generator as a base model. Experiments on the benchmark datasets show that (1) imitation learning is constantly better than reinforcement learning; and (2) the pointer-generator models with imitation learning outperform the state-of-the-art methods with a large margin.

* 9 pages, 2 figures, EMNLP2019

Via

Access Paper or Ask Questions

Dynamic Entity Representations in Neural Language Models

Aug 02, 2017

Yangfeng Ji, Chenhao Tan, Sebastian Martschat, Yejin Choi, Noah A. Smith

Figure 1 for Dynamic Entity Representations in Neural Language Models

Figure 2 for Dynamic Entity Representations in Neural Language Models

Figure 3 for Dynamic Entity Representations in Neural Language Models

Figure 4 for Dynamic Entity Representations in Neural Language Models

Abstract:Understanding a long document requires tracking how entities are introduced and evolve over time. We present a new type of language model, EntityNLM, that can explicitly model entities, dynamically update their representations, and contextually generate their mentions. Our model is generative and flexible; it can model an arbitrary number of entities in context while generating each entity mention at an arbitrary length. In addition, it can be used for several different tasks such as language modeling, coreference resolution, and entity prediction. Experimental results with all these tasks demonstrate that our model consistently outperforms strong baselines and prior work.

* EMNLP 2017 camera-ready version

Via

Access Paper or Ask Questions

Neural Discourse Structure for Text Categorization

May 06, 2017

Yangfeng Ji, Noah Smith

Figure 1 for Neural Discourse Structure for Text Categorization

Figure 2 for Neural Discourse Structure for Text Categorization

Figure 3 for Neural Discourse Structure for Text Categorization

Figure 4 for Neural Discourse Structure for Text Categorization

Abstract:We show that discourse structure, as defined by Rhetorical Structure Theory and provided by an existing discourse parser, benefits text categorization. Our approach uses a recursive neural network and a newly proposed attention mechanism to compute a representation of the text that focuses on salient content, from the perspective of both RST and the task. Experiments consider variants of the approach and illustrate its strengths and weaknesses.

* ACL 2017 camera ready version

Via

Access Paper or Ask Questions

DyNet: The Dynamic Neural Network Toolkit

Jan 15, 2017

Graham Neubig, Chris Dyer, Yoav Goldberg, Austin Matthews, Waleed Ammar, Antonios Anastasopoulos, Miguel Ballesteros, David Chiang, Daniel Clothiaux, Trevor Cohn(+15 more)

Figure 1 for DyNet: The Dynamic Neural Network Toolkit

Figure 2 for DyNet: The Dynamic Neural Network Toolkit

Figure 3 for DyNet: The Dynamic Neural Network Toolkit

Figure 4 for DyNet: The Dynamic Neural Network Toolkit

Abstract:We describe DyNet, a toolkit for implementing neural network models based on dynamic declaration of network structure. In the static declaration strategy that is used in toolkits like Theano, CNTK, and TensorFlow, the user first defines a computation graph (a symbolic representation of the computation), and then examples are fed into an engine that executes this computation and computes its derivatives. In DyNet's dynamic declaration strategy, computation graph construction is mostly transparent, being implicitly constructed by executing procedural code that computes the network outputs, and the user is free to use different network structures for each input. Dynamic declaration thus facilitates the implementation of more complicated network architectures, and DyNet is specifically designed to allow users to implement their models in a way that is idiomatic in their preferred programming language (C++ or Python). One challenge with dynamic declaration is that because the symbolic computation graph is defined anew for every training example, its construction must have low overhead. To achieve this, DyNet has an optimized C++ backend and lightweight graph representation. Experiments show that DyNet's speeds are faster than or comparable with static declaration toolkits, and significantly faster than Chainer, another dynamic declaration toolkit. DyNet is released open-source under the Apache 2.0 license and available at http://github.com/clab/dynet.

* 33 pages

Via

Access Paper or Ask Questions

A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

Apr 05, 2016

Yangfeng Ji, Gholamreza Haffari, Jacob Eisenstein

Figure 1 for A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

Figure 2 for A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

Figure 3 for A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

Figure 4 for A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

Abstract:This paper presents a novel latent variable recurrent neural network architecture for jointly modeling sequences of words and (possibly latent) discourse relations between adjacent sentences. A recurrent neural network generates individual words, thus reaping the benefits of discriminatively-trained vector representations. The discourse relations are represented with a latent variable, which can be predicted or marginalized, depending on the task. The resulting model can therefore employ a training objective that includes not only discourse relation classification, but also word prediction. As a result, it outperforms state-of-the-art alternatives for two tasks: implicit discourse relation classification in the Penn Discourse Treebank, and dialog act classification in the Switchboard corpus. Furthermore, by marginalizing over latent discourse relations at test time, we obtain a discourse informed language model, which improves over a strong LSTM baseline.

* NAACL 2016 camera ready, 11 pages

Via

Access Paper or Ask Questions

LSTM based Conversation Models

Mar 31, 2016

Yi Luan, Yangfeng Ji, Mari Ostendorf

Figure 1 for LSTM based Conversation Models

Figure 2 for LSTM based Conversation Models

Figure 3 for LSTM based Conversation Models

Figure 4 for LSTM based Conversation Models

Abstract:In this paper, we present a conversational model that incorporates both context and participant role for two-party conversations. Different architectures are explored for integrating participant role and context information into a Long Short-term Memory (LSTM) language model. The conversational model can function as a language model or a language generation model. Experiments on the Ubuntu Dialog Corpus show that our model can capture multiple turn interaction between participants. The proposed method outperforms a traditional LSTM model as measured by language model perplexity and response ranking. Generated responses show characteristic differences between the two participant roles.

Via

Access Paper or Ask Questions