Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Identifying Spurious Correlations for Robust Text Classification

Oct 06, 2020

Zhao Wang, Aron Culotta

Figure 1 for Identifying Spurious Correlations for Robust Text Classification

Figure 2 for Identifying Spurious Correlations for Robust Text Classification

Figure 3 for Identifying Spurious Correlations for Robust Text Classification

Figure 4 for Identifying Spurious Correlations for Robust Text Classification

Share this with someone who'll enjoy it:

Abstract:The predictions of text classifiers are often driven by spurious correlations -- e.g., the term `Spielberg' correlates with positively reviewed movies, even though the term itself does not semantically convey a positive sentiment. In this paper, we propose a method to distinguish spurious and genuine correlations in text classification. We treat this as a supervised classification problem, using features derived from treatment effect estimators to distinguish spurious correlations from "genuine" ones. Due to the generic nature of these features and their small dimensionality, we find that the approach works well even with limited training examples, and that it is possible to transport the word classifier to new domains. Experiments on four datasets (sentiment classification and toxicity detection) suggest that using this approach to inform feature selection also leads to more robust classification, as measured by improved worst-case accuracy on the samples affected by spurious correlations.

* Findings of EMNLP-2020 * Findings of EMNLP-2020

View paper on

Share this with someone who'll enjoy it:

Title:Identifying Spurious Correlations for Robust Text Classification

Paper and Code