Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kristina Lerman

Speaker Turn Modeling for Dialogue Act Classification

Sep 10, 2021
Zihao He, Leili Tavabi, Kristina Lerman, Mohammad Soleymani

Figure 1 for Speaker Turn Modeling for Dialogue Act Classification

Figure 2 for Speaker Turn Modeling for Dialogue Act Classification

Figure 3 for Speaker Turn Modeling for Dialogue Act Classification

Figure 4 for Speaker Turn Modeling for Dialogue Act Classification

Dialogue Act (DA) classification is the task of classifying utterances with respect to the function they serve in a dialogue. Existing approaches to DA classification model utterances without incorporating the turn changes among speakers throughout the dialogue, therefore treating it no different than non-interactive written text. In this paper, we propose to integrate the turn changes in conversations among speakers when modeling DAs. Specifically, we learn conversation-invariant speaker turn embeddings to represent the speaker turns in a conversation; the learned speaker turn embeddings are then merged with the utterance embeddings for the downstream task of DA classification. With this simple yet effective mechanism, our model is able to capture the semantics from the dialogue content while accounting for different speaker turns in a conversation. Validation on three benchmark public datasets demonstrates superior performance of our model.

Via

Access Paper or Ask Questions

DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Aug 31, 2021
Nazanin Alipourfard, Keith Burghardt, Kristina Lerman

Figure 1 for DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Figure 2 for DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Figure 3 for DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Figure 4 for DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Quantitative analysis of large-scale data is often complicated by the presence of diverse subgroups, which reduce the accuracy of inferences they make on held-out data. To address the challenge of heterogeneous data analysis, we introduce DoGR, a method that discovers latent confounders by simultaneously partitioning the data into overlapping clusters (disaggregation) and modeling the behavior within them (regression). When applied to real-world data, our method discovers meaningful clusters and their characteristic behaviors, thus giving insight into group differences and their impact on the outcome of interest. By accounting for latent confounders, our framework facilitates exploratory analysis of noisy, heterogeneous data and can be used to learn predictive models that better generalize to new data. We provide the code to enable others to use DoGR within their data analytic workflows.

Via

Access Paper or Ask Questions

Pattern Discovery in Time Series with Byte Pair Encoding

May 30, 2021
Nazgol Tavabi, Kristina Lerman

Figure 1 for Pattern Discovery in Time Series with Byte Pair Encoding

Figure 2 for Pattern Discovery in Time Series with Byte Pair Encoding

Figure 3 for Pattern Discovery in Time Series with Byte Pair Encoding

Figure 4 for Pattern Discovery in Time Series with Byte Pair Encoding

The growing popularity of wearable sensors has generated large quantities of temporal physiological and activity data. Ability to analyze this data offers new opportunities for real-time health monitoring and forecasting. However, temporal physiological data presents many analytic challenges: the data is noisy, contains many missing values, and each series has a different length. Most methods proposed for time series analysis and classification do not handle datasets with these characteristics nor do they offer interpretability and explainability, a critical requirement in the health domain. We propose an unsupervised method for learning representations of time series based on common patterns identified within them. The patterns are, interpretable, variable in length, and extracted using Byte Pair Encoding compression technique. In this way the method can capture both long-term and short-term dependencies present in the data. We show that this method applies to both univariate and multivariate time series and beats state-of-the-art approaches on a real world dataset collected from wearable sensors.

Via

Access Paper or Ask Questions

Detecting Polarized Topics in COVID-19 News Using Partisanship-aware Contextualized Topic Embeddings

Apr 15, 2021
Zihao He, Negar Mokhberian, Antonio Camara, Andres Abeliuk, Kristina Lerman

Figure 1 for Detecting Polarized Topics in COVID-19 News Using Partisanship-aware Contextualized Topic Embeddings

Figure 2 for Detecting Polarized Topics in COVID-19 News Using Partisanship-aware Contextualized Topic Embeddings

Figure 3 for Detecting Polarized Topics in COVID-19 News Using Partisanship-aware Contextualized Topic Embeddings

Figure 4 for Detecting Polarized Topics in COVID-19 News Using Partisanship-aware Contextualized Topic Embeddings

Growing polarization of the news media has been blamed for fanning disagreement, controversy and even violence. Early identification of polarized topics is thus an urgent matter that can help mitigate conflict. However, accurate measurement of polarization is still an open research challenge. To address this gap, we propose Partisanship-aware Contextualized Topic Embeddings (PaCTE), a method to automatically detect polarized topics from partisan news sources. Specifically, we represent the ideology of a news source on a topic by corpus-contextualized topic embedding utilizing a language model that has been finetuned on recognizing partisanship of the news articles, and measure the polarization between sources using cosine similarity. We apply our method to a corpus of news about COVID-19 pandemic. Extensive experiments on different news sources and topics demonstrate the effectiveness of our method to precisely capture the topical polarization and alignment between different news sources. To help clarify and validate results, we explain the polarization using the Moral Foundation Theory.

Via

Access Paper or Ask Questions

Individualized Context-Aware Tensor Factorization for Online Games Predictions

Feb 22, 2021
Julie Jiang, Kristina Lerman, Emilio Ferrara

Figure 1 for Individualized Context-Aware Tensor Factorization for Online Games Predictions

Figure 2 for Individualized Context-Aware Tensor Factorization for Online Games Predictions

Figure 3 for Individualized Context-Aware Tensor Factorization for Online Games Predictions

Figure 4 for Individualized Context-Aware Tensor Factorization for Online Games Predictions

Individual behavior and decisions are substantially influenced by their contexts, such as location, environment, and time. Changes along these dimensions can be readily observed in Multiplayer Online Battle Arena games (MOBA), where players face different in-game settings for each match and are subject to frequent game patches. Existing methods utilizing contextual information generalize the effect of a context over the entire population, but contextual information tailored to each individual can be more effective. To achieve this, we present the Neural Individualized Context-aware Embeddings (NICE) model for predicting user performance and game outcomes. Our proposed method identifies individual behavioral differences in different contexts by learning latent representations of users and contexts through non-negative tensor factorization. Using a dataset from the MOBA game League of Legends, we demonstrate that our model substantially improves the prediction of winning outcome, individual user performance, and user engagement.

* 2020 International Conference on Data Mining Workshops (ICDMW)

Via

Access Paper or Ask Questions

Inherent Trade-offs in the Fair Allocation of Treatments

Oct 30, 2020
Yuzi He, Keith Burghardt, Siyi Guo, Kristina Lerman

Figure 1 for Inherent Trade-offs in the Fair Allocation of Treatments

Figure 2 for Inherent Trade-offs in the Fair Allocation of Treatments

Figure 3 for Inherent Trade-offs in the Fair Allocation of Treatments

Figure 4 for Inherent Trade-offs in the Fair Allocation of Treatments

Explicit and implicit bias clouds human judgement, leading to discriminatory treatment of minority groups. A fundamental goal of algorithmic fairness is to avoid the pitfalls in human judgement by learning policies that improve the overall outcomes while providing fair treatment to protected classes. In this paper, we propose a causal framework that learns optimal intervention policies from data subject to fairness constraints. We define two measures of treatment bias and infer best treatment assignment that minimizes the bias while optimizing overall outcome. We demonstrate that there is a dilemma of balancing fairness and overall benefit; however, allowing preferential treatment to protected classes in certain circumstances (affirmative action) can dramatically improve the overall benefit while also preserving fairness. We apply our framework to data containing student outcomes on standardized tests and show how it can be used to design real-world policies that fairly improve student test scores. Our framework provides a principled way to learn fair treatment policies in real-world settings.

Via

Access Paper or Ask Questions

Challenges in Forecasting Malicious Events from Incomplete Data

Apr 06, 2020
Nazgol Tavabi, Andrés Abeliuk, Negar Mokhberian, Jeremy Abramson, Kristina Lerman

Figure 1 for Challenges in Forecasting Malicious Events from Incomplete Data

Figure 2 for Challenges in Forecasting Malicious Events from Incomplete Data

Figure 3 for Challenges in Forecasting Malicious Events from Incomplete Data

Figure 4 for Challenges in Forecasting Malicious Events from Incomplete Data

The ability to accurately predict cyber-attacks would enable organizations to mitigate their growing threat and avert the financial losses and disruptions they cause. But how predictable are cyber-attacks? Researchers have attempted to combine external data -- ranging from vulnerability disclosures to discussions on Twitter and the darkweb -- with machine learning algorithms to learn indicators of impending cyber-attacks. However, successful cyber-attacks represent a tiny fraction of all attempted attacks: the vast majority are stopped, or filtered by the security appliances deployed at the target. As we show in this paper, the process of filtering reduces the predictability of cyber-attacks. The small number of attacks that do penetrate the target's defenses follow a different generative process compared to the whole data which is much harder to learn for predictive models. This could be caused by the fact that the resulting time series also depends on the filtering process in addition to all the different factors that the original time series depended on. We empirically quantify the loss of predictability due to filtering using real-world data from two organizations. Our work identifies the limits to forecasting cyber-attacks from highly filtered data.

* Accepted in The Fifth Workshop on Computational Methods in Online Misbehavior, Companion Proceedings of The 2020 World Wide Web Conference (WWW '20)

Via

Access Paper or Ask Questions

Learning Behavioral Representations from Wearable Sensors

Nov 16, 2019
Nazgol Tavabi, Homa Hosseinmardi, Jennifer L. Villatte, Andrés Abeliuk, Shrikanth Narayanan, Emilio Ferrara, Kristina Lerman

Figure 1 for Learning Behavioral Representations from Wearable Sensors

Figure 2 for Learning Behavioral Representations from Wearable Sensors

Figure 3 for Learning Behavioral Representations from Wearable Sensors

Figure 4 for Learning Behavioral Representations from Wearable Sensors

The ubiquity of mobile devices and wearable sensors offers unprecedented opportunities for continuous collection of multimodal physiological data. Such data enables temporal characterization of an individual's behaviors, which can provide unique insights into her physical and psychological health. Understanding the relation between different behaviors/activities and personality traits such as stress or work performance can help build strategies to improve the work environment. Especially in workplaces like hospitals where many employees are overworked, having such policies improves the quality of patient care by prioritizing mental and physical health of their caregivers. One challenge in analyzing physiological data is extracting the underlying behavioral states from the temporal sensor signals and interpreting them. Here, we use a non-parametric Bayesian approach, to model multivariate sensor data from multiple people and discover dynamic behaviors they share. We apply this method to data collected from sensors worn by a population of workers in a large urban hospital, capturing their physiological signals, such as breathing and heart rate, and activity patterns. We show that the learned states capture behavioral differences within the population that can help cluster participants into meaningful groups and better predict their cognitive and affective states. This method offers a practical way to learn compact behavioral representations from dynamic multivariate sensor signals and provide insights into the data.

Via

Access Paper or Ask Questions

Learning Fair and Interpretable Representations via Linear Orthogonalization

Oct 28, 2019
Yuzi He, Keith Burghardt, Kristina Lerman

Figure 1 for Learning Fair and Interpretable Representations via Linear Orthogonalization

Figure 2 for Learning Fair and Interpretable Representations via Linear Orthogonalization

Figure 3 for Learning Fair and Interpretable Representations via Linear Orthogonalization

Figure 4 for Learning Fair and Interpretable Representations via Linear Orthogonalization

To reduce human error and prejudice, many high-stakes decisions have been turned over to machine algorithms. However, recent research suggests that this does not remove discrimination, and can perpetuate harmful stereotypes. While algorithms have been developed to improve fairness, they typically face at least one of three shortcomings: they are not interpretable, they lose significant accuracy compared to unbiased equivalents, or they are not transferable across models. To address these issues, we propose a geometric method that removes correlations between data and any number of protected variables. Further, we can control the strength of debiasing through an adjustable parameter to address the trade-off between model accuracy and fairness. The resulting features are interpretable and can be used with many popular models, such as linear regression, random forest and multilayer perceptrons. The resulting predictions are found to be more accurate and fair than several comparable fair AI algorithms across a variety of benchmark datasets. Our work shows that debiasing data is a simple and effective solution toward improving fairness.

* 9 pages, 5 figures

Via

Access Paper or Ask Questions

A Survey on Bias and Fairness in Machine Learning

Sep 17, 2019
Ninareh Mehrabi, Fred Morstatter, Nripsuta Saxena, Kristina Lerman, Aram Galstyan

Figure 1 for A Survey on Bias and Fairness in Machine Learning

Figure 2 for A Survey on Bias and Fairness in Machine Learning

Figure 3 for A Survey on Bias and Fairness in Machine Learning

Figure 4 for A Survey on Bias and Fairness in Machine Learning

With the widespread use of AI systems and applications in our everyday lives, it is important to take fairness issues into consideration while designing and engineering these types of systems. Such systems can be used in many sensitive environments to make important and life-changing decisions; thus, it is crucial to ensure that the decisions do not reflect discriminatory behavior toward certain groups or populations. We have recently seen work in machine learning, natural language processing, and deep learning that addresses such challenges in different subdomains. With the commercialization of these systems, researchers are becoming aware of the biases that these applications can contain and have attempted to address them. In this survey we investigated different real-world applications that have shown biases in various ways, and we listed different sources of biases that can affect AI applications. We then created a taxonomy for fairness definitions that machine learning researchers have defined in order to avoid the existing bias in AI systems. In addition to that, we examined different domains and subdomains in AI showing what researchers have observed with regard to unfair outcomes in the state-of-the-art methods and how they have tried to address them. There are still many future directions and solutions that can be taken to mitigate the problem of bias in AI systems. We are hoping that this survey will motivate researchers to tackle these issues in the near future by observing existing work in their respective fields.

Via

Access Paper or Ask Questions