Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fred Morstatter

Attributing Fair Decisions with Attention Interventions

Sep 08, 2021
Ninareh Mehrabi, Umang Gupta, Fred Morstatter, Greg Ver Steeg, Aram Galstyan

Figure 1 for Attributing Fair Decisions with Attention Interventions

Figure 2 for Attributing Fair Decisions with Attention Interventions

Figure 3 for Attributing Fair Decisions with Attention Interventions

Figure 4 for Attributing Fair Decisions with Attention Interventions

The widespread use of Artificial Intelligence (AI) in consequential domains, such as healthcare and parole decision-making systems, has drawn intense scrutiny on the fairness of these methods. However, ensuring fairness is often insufficient as the rationale for a contentious decision needs to be audited, understood, and defended. We propose that the attention mechanism can be used to ensure fair outcomes while simultaneously providing feature attributions to account for how a decision was made. Toward this goal, we design an attention-based model that can be leveraged as an attribution framework. It can identify features responsible for both performance and fairness of the model through attention interventions and attention weight manipulation. Using this attribution framework, we then design a post-processing bias mitigation strategy and compare it with a suite of baselines. We demonstrate the versatility of our approach by conducting experiments on two distinct data types, tabular and textual.

Via

Access Paper or Ask Questions

Analyzing Race and Country of Citizenship Bias in Wikidata

Aug 11, 2021
Zaina Shaik, Filip Ilievski, Fred Morstatter

Figure 1 for Analyzing Race and Country of Citizenship Bias in Wikidata

Figure 2 for Analyzing Race and Country of Citizenship Bias in Wikidata

Figure 3 for Analyzing Race and Country of Citizenship Bias in Wikidata

As an open and collaborative knowledge graph created by users and bots, it is possible that the knowledge in Wikidata is biased in regards to multiple factors such as gender, race, and country of citizenship. Previous work has mostly studied the representativeness of Wikidata knowledge in terms of genders of people. In this paper, we examine the race and citizenship bias in general and in regards to STEM representation for scientists, software developers, and engineers. By comparing Wikidata queries to real-world datasets, we identify the differences in representation to characterize the biases present in Wikidata. Through this analysis, we discovered that there is an overrepresentation of white individuals and those with citizenship in Europe and North America; the rest of the groups are generally underrepresented. Based on these findings, we have found and linked to Wikidata additional data about STEM scientists from the minorities. This data is ready to be inserted into Wikidata with a bot. Increasing representation of minority race and country of citizenship groups can create a more accurate portrayal of individuals in STEM.

Via

Access Paper or Ask Questions

Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources

Mar 21, 2021
Ninareh Mehrabi, Pei Zhou, Fred Morstatter, Jay Pujara, Xiang Ren, Aram Galstyan

Figure 1 for Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources

Figure 2 for Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources

Figure 3 for Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources

Figure 4 for Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources

Warning: this paper contains content that may be offensive or upsetting. Numerous natural language processing models have tried injecting commonsense by using the ConceptNet knowledge base to improve performance on different tasks. ConceptNet, however, is mostly crowdsourced from humans and may reflect human biases such as "lawyers are dishonest." It is important that these biases are not conflated with the notion of commonsense. We study this missing yet important problem by first defining and quantifying biases in ConceptNet as two types of representational harms: overgeneralization of polarized perceptions and representation disparity. We find that ConceptNet contains severe biases and disparities across four demographic categories. In addition, we analyze two downstream models that use ConceptNet as a source for commonsense knowledge and find the existence of biases in those models as well. We further propose a filtered-based bias-mitigation approach and examine its effectiveness. We show that our mitigation approach can reduce the issues in both resource and models but leads to a performance drop, leaving room for future work to build fairer and stronger commonsense models.

Via

Access Paper or Ask Questions

Exacerbating Algorithmic Bias through Fairness Attacks

Dec 16, 2020
Ninareh Mehrabi, Muhammad Naveed, Fred Morstatter, Aram Galstyan

Figure 1 for Exacerbating Algorithmic Bias through Fairness Attacks

Figure 2 for Exacerbating Algorithmic Bias through Fairness Attacks

Figure 3 for Exacerbating Algorithmic Bias through Fairness Attacks

Figure 4 for Exacerbating Algorithmic Bias through Fairness Attacks

Algorithmic fairness has attracted significant attention in recent years, with many quantitative measures suggested for characterizing the fairness of different machine learning algorithms. Despite this interest, the robustness of those fairness measures with respect to an intentional adversarial attack has not been properly addressed. Indeed, most adversarial machine learning has focused on the impact of malicious attacks on the accuracy of the system, without any regard to the system's fairness. We propose new types of data poisoning attacks where an adversary intentionally targets the fairness of a system. Specifically, we propose two families of attacks that target fairness measures. In the anchoring attack, we skew the decision boundary by placing poisoned points near specific target points to bias the outcome. In the influence attack on fairness, we aim to maximize the covariance between the sensitive attributes and the decision outcome and affect the fairness of the model. We conduct extensive experiments that indicate the effectiveness of our proposed attacks.

Via

Access Paper or Ask Questions

One-shot Learning for Temporal Knowledge Graphs

Oct 23, 2020
Mehrnoosh Mirtaheri, Mohammad Rostami, Xiang Ren, Fred Morstatter, Aram Galstyan

Figure 1 for One-shot Learning for Temporal Knowledge Graphs

Figure 2 for One-shot Learning for Temporal Knowledge Graphs

Figure 3 for One-shot Learning for Temporal Knowledge Graphs

Figure 4 for One-shot Learning for Temporal Knowledge Graphs

Most real-world knowledge graphs are characterized by a long-tail relation frequency distribution where a significant fraction of relations occurs only a handful of times. This observation has given rise to recent interest in low-shot learning methods that are able to generalize from only a few examples. The existing approaches, however, are tailored to static knowledge graphs and not easily generalized to temporal settings, where data scarcity poses even bigger problems, e.g., due to occurrence of new, previously unseen relations. We address this shortcoming by proposing a one-shot learning framework for link prediction in temporal knowledge graphs. Our proposed method employs a self-attention mechanism to effectively encode temporal interactions between entities, and a network to compute a similarity score between a given query and a (one-shot) example. Our experiments show that the proposed algorithm outperforms the state of the art baselines for two well-studied benchmarks while achieving significantly better performance for sparse relations.

Via

Access Paper or Ask Questions

Leveraging Clickstream Trajectories to Reveal Low-Quality Workers in Crowdsourced Forecasting Platforms

Sep 04, 2020
Akira Matsui, Emilio Ferrara, Fred Morstatter, Andres Abeliuk, Aram Galstyan

Figure 1 for Leveraging Clickstream Trajectories to Reveal Low-Quality Workers in Crowdsourced Forecasting Platforms

Figure 2 for Leveraging Clickstream Trajectories to Reveal Low-Quality Workers in Crowdsourced Forecasting Platforms

Figure 3 for Leveraging Clickstream Trajectories to Reveal Low-Quality Workers in Crowdsourced Forecasting Platforms

Figure 4 for Leveraging Clickstream Trajectories to Reveal Low-Quality Workers in Crowdsourced Forecasting Platforms

Crowdwork often entails tackling cognitively-demanding and time-consuming tasks. Crowdsourcing can be used for complex annotation tasks, from medical imaging to geospatial data, and such data powers sensitive applications, such as health diagnostics or autonomous driving. However, the existence and prevalence of underperforming crowdworkers is well-recognized, and can pose a threat to the validity of crowdsourcing. In this study, we propose the use of a computational framework to identify clusters of underperforming workers using clickstream trajectories. We focus on crowdsourced geopolitical forecasting. The framework can reveal different types of underperformers, such as workers with forecasts whose accuracy is far from the consensus of the crowd, those who provide low-quality explanations for their forecasts, and those who simply copy-paste their forecasts from other users. Our study suggests that clickstream clustering and analysis are fundamental tools to diagnose the performance of crowdworkers in platforms leveraging the wisdom of crowds.

* 12 pages, 8 figures

Via

Access Paper or Ask Questions

Statistical Equity: A Fairness Classification Objective

May 14, 2020
Ninareh Mehrabi, Yuzhong Huang, Fred Morstatter

Figure 1 for Statistical Equity: A Fairness Classification Objective

Figure 2 for Statistical Equity: A Fairness Classification Objective

Figure 3 for Statistical Equity: A Fairness Classification Objective

Figure 4 for Statistical Equity: A Fairness Classification Objective

Machine learning systems have been shown to propagate the societal errors of the past. In light of this, a wealth of research focuses on designing solutions that are "fair." Even with this abundance of work, there is no singular definition of fairness, mainly because fairness is subjective and context dependent. We propose a new fairness definition, motivated by the principle of equity, that considers existing biases in the data and attempts to make equitable decisions that account for these previous historical biases. We formalize our definition of fairness, and motivate it with its appropriate contexts. Next, we operationalize it for equitable classification. We perform multiple automatic and human evaluations to show the effectiveness of our definition and demonstrate its utility for aspects of fairness, such as the feedback loop.

Via

Access Paper or Ask Questions

Identifying Cultural Differences through Multi-Lingual Wikipedia

Apr 10, 2020
Yufei Tian, Tuhin Chakrabarty, Fred Morstatter, Nanyun Peng

Figure 1 for Identifying Cultural Differences through Multi-Lingual Wikipedia

Figure 2 for Identifying Cultural Differences through Multi-Lingual Wikipedia

Figure 3 for Identifying Cultural Differences through Multi-Lingual Wikipedia

Figure 4 for Identifying Cultural Differences through Multi-Lingual Wikipedia

Understanding cross-cultural differences is an important application of natural language understanding. This problem is difficult due to the relativism between cultures. We present a computational approach to learn cultural models that encode the general opinions and values of cultures from multi-lingual Wikipedia. Specifically, we assume a language is a symbol of a culture and different languages represent different cultures. Our model can automatically identify statements that potentially reflect cultural differences. Experiments on English and Chinese languages show that on a held out set of diverse topics, including marriage, gun control, democracy, etc., our model achieves high correlation with human judgements regarding within-culture values and cultural differences.

Via

Access Paper or Ask Questions

Aggressive, Repetitive, Intentional, Visible, and Imbalanced: Refining Representations for Cyberbullying Classification

Apr 04, 2020
Caleb Ziems, Ymir Vigfusson, Fred Morstatter

Figure 1 for Aggressive, Repetitive, Intentional, Visible, and Imbalanced: Refining Representations for Cyberbullying Classification

Figure 2 for Aggressive, Repetitive, Intentional, Visible, and Imbalanced: Refining Representations for Cyberbullying Classification

Figure 3 for Aggressive, Repetitive, Intentional, Visible, and Imbalanced: Refining Representations for Cyberbullying Classification

Figure 4 for Aggressive, Repetitive, Intentional, Visible, and Imbalanced: Refining Representations for Cyberbullying Classification

Cyberbullying is a pervasive problem in online communities. To identify cyberbullying cases in large-scale social networks, content moderators depend on machine learning classifiers for automatic cyberbullying detection. However, existing models remain unfit for real-world applications, largely due to a shortage of publicly available training data and a lack of standard criteria for assigning ground truth labels. In this study, we address the need for reliable data using an original annotation framework. Inspired by social sciences research into bullying behavior, we characterize the nuanced problem of cyberbullying using five explicit factors to represent its social and linguistic aspects. We model this behavior using social network and language-based features, which improve classifier performance. These results demonstrate the importance of representing and modeling cyberbullying as a social phenomenon.

* 12 pages, 5 figures, 22 tables, Accepted to the 14th International AAAI Conference on Web and Social Media, ICWSM'20

Via

Access Paper or Ask Questions

Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition

Oct 24, 2019
Ninareh Mehrabi, Thamme Gowda, Fred Morstatter, Nanyun Peng, Aram Galstyan

Figure 1 for Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition

Figure 2 for Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition

Figure 3 for Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition

Figure 4 for Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition

We study the bias in several state-of-the-art named entity recognition (NER) models---specifically, a difference in the ability to recognize male and female names as PERSON entity types. We evaluate NER models on a dataset containing 139 years of U.S. census baby names and find that relatively more female names, as opposed to male names, are not recognized as PERSON entities. We study the extent of this bias in several NER systems that are used prominently in industry and academia. In addition, we also report a bias in the datasets on which these models were trained. The result of this analysis yields a new benchmark for gender bias evaluation in named entity recognition systems. The data and code for the application of this benchmark will be publicly available for researchers to use.

Via

Access Paper or Ask Questions