Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Keith Burghardt

Information Science Institute, University of Southern California

A Python Package to Detect Anti-Vaccine Users on Twitter

Oct 21, 2021

Matheus Schmitz, Goran Murić, Keith Burghardt

Figure 1 for A Python Package to Detect Anti-Vaccine Users on Twitter

Figure 2 for A Python Package to Detect Anti-Vaccine Users on Twitter

Figure 3 for A Python Package to Detect Anti-Vaccine Users on Twitter

Figure 4 for A Python Package to Detect Anti-Vaccine Users on Twitter

Abstract:Vaccine hesitancy has a long history but has been recently driven by the anti-vaccine narratives shared online, which significantly degrades the efficacy of vaccination strategies, such as those for COVID-19. Despite broad agreement in the medical community about the safety and efficacy of available vaccines, a large number of social media users continue to be inundated with false information about vaccines and, partly because of this, became indecisive or unwilling to be vaccinated. The goal of this study is to better understand anti-vaccine sentiment, and work to reduce its impact, by developing a system capable of automatically identifying the users responsible for spreading anti-vaccine narratives. We introduce a publicly available Python package capable of analyzing Twitter profiles to assess how likely that profile is to spread anti-vaccine sentiment in the future. The software package is built using text embedding methods, neural networks, and automated dataset generation. It is trained on over one hundred thousand accounts and several million tweets. This model will help researchers and policy-makers understand anti-vaccine discussion and misinformation strategies, which can further help tailor targeted campaigns seeking to inform and debunk the harmful anti-vaccination myths currently being spread. Additionally, we leverage the data on such users to understand what are the moral and emotional characteristics of anti-vaccine spreaders.

Via

Access Paper or Ask Questions

DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Aug 31, 2021

Nazanin Alipourfard, Keith Burghardt, Kristina Lerman

Figure 1 for DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Figure 2 for DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Figure 3 for DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Figure 4 for DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Abstract:Quantitative analysis of large-scale data is often complicated by the presence of diverse subgroups, which reduce the accuracy of inferences they make on held-out data. To address the challenge of heterogeneous data analysis, we introduce DoGR, a method that discovers latent confounders by simultaneously partitioning the data into overlapping clusters (disaggregation) and modeling the behavior within them (regression). When applied to real-world data, our method discovers meaningful clusters and their characteristic behaviors, thus giving insight into group differences and their impact on the outcome of interest. By accounting for latent confounders, our framework facilitates exploratory analysis of noisy, heterogeneous data and can be used to learn predictive models that better generalize to new data. We provide the code to enable others to use DoGR within their data analytic workflows.

Via

Access Paper or Ask Questions

Inherent Trade-offs in the Fair Allocation of Treatments

Oct 30, 2020

Yuzi He, Keith Burghardt, Siyi Guo, Kristina Lerman

Figure 1 for Inherent Trade-offs in the Fair Allocation of Treatments

Figure 2 for Inherent Trade-offs in the Fair Allocation of Treatments

Figure 3 for Inherent Trade-offs in the Fair Allocation of Treatments

Figure 4 for Inherent Trade-offs in the Fair Allocation of Treatments

Abstract:Explicit and implicit bias clouds human judgement, leading to discriminatory treatment of minority groups. A fundamental goal of algorithmic fairness is to avoid the pitfalls in human judgement by learning policies that improve the overall outcomes while providing fair treatment to protected classes. In this paper, we propose a causal framework that learns optimal intervention policies from data subject to fairness constraints. We define two measures of treatment bias and infer best treatment assignment that minimizes the bias while optimizing overall outcome. We demonstrate that there is a dilemma of balancing fairness and overall benefit; however, allowing preferential treatment to protected classes in certain circumstances (affirmative action) can dramatically improve the overall benefit while also preserving fairness. We apply our framework to data containing student outcomes on standardized tests and show how it can be used to design real-world policies that fairly improve student test scores. Our framework provides a principled way to learn fair treatment policies in real-world settings.

Via

Access Paper or Ask Questions

Learning Fair and Interpretable Representations via Linear Orthogonalization

Oct 28, 2019

Yuzi He, Keith Burghardt, Kristina Lerman

Figure 1 for Learning Fair and Interpretable Representations via Linear Orthogonalization

Figure 2 for Learning Fair and Interpretable Representations via Linear Orthogonalization

Figure 3 for Learning Fair and Interpretable Representations via Linear Orthogonalization

Figure 4 for Learning Fair and Interpretable Representations via Linear Orthogonalization

Abstract:To reduce human error and prejudice, many high-stakes decisions have been turned over to machine algorithms. However, recent research suggests that this does not remove discrimination, and can perpetuate harmful stereotypes. While algorithms have been developed to improve fairness, they typically face at least one of three shortcomings: they are not interpretable, they lose significant accuracy compared to unbiased equivalents, or they are not transferable across models. To address these issues, we propose a geometric method that removes correlations between data and any number of protected variables. Further, we can control the strength of debiasing through an adjustable parameter to address the trade-off between model accuracy and fairness. The resulting features are interpretable and can be used with many popular models, such as linear regression, random forest and multilayer perceptrons. The resulting predictions are found to be more accurate and fair than several comparable fair AI algorithms across a variety of benchmark datasets. Our work shows that debiasing data is a simple and effective solution toward improving fairness.

* 9 pages, 5 figures

Via

Access Paper or Ask Questions

The Myopia of Crowds: A Study of Collective Evaluation on Stack Exchange

Feb 24, 2016

Keith Burghardt, Emanuel F. Alsina, Michelle Girvan, William Rand, Kristina Lerman

Figure 1 for The Myopia of Crowds: A Study of Collective Evaluation on Stack Exchange

Figure 2 for The Myopia of Crowds: A Study of Collective Evaluation on Stack Exchange

Figure 3 for The Myopia of Crowds: A Study of Collective Evaluation on Stack Exchange

Figure 4 for The Myopia of Crowds: A Study of Collective Evaluation on Stack Exchange

Abstract:Crowds can often make better decisions than individuals or small groups of experts by leveraging their ability to aggregate diverse information. Question answering sites, such as Stack Exchange, rely on the "wisdom of crowds" effect to identify the best answers to questions asked by users. We analyze data from 250 communities on the Stack Exchange network to pinpoint factors affecting which answers are chosen as the best answers. Our results suggest that, rather than evaluate all available answers to a question, users rely on simple cognitive heuristics to choose an answer to vote for or accept. These cognitive heuristics are linked to an answer's salience, such as the order in which it is listed and how much screen space it occupies. While askers appear to depend more on heuristics, compared to voting users, when choosing an answer to accept as the most helpful one, voters use acceptance itself as a heuristic: they are more likely to choose the answer after it is accepted than before that very same answer was accepted. These heuristics become more important in explaining and predicting behavior as the number of available answers increases. Our findings suggest that crowd judgments may become less reliable as the number of answers grow.

* PLoS ONE 12(3): e0173610
* 10 pages, 9 figures

Via

Access Paper or Ask Questions