Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fereshte Khani

Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately

Dec 07, 2020
Fereshte Khani, Percy Liang

Figure 1 for Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately

Figure 2 for Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately

Figure 3 for Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately

Figure 4 for Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately

The presence of spurious features interferes with the goal of obtaining robust models that perform well across many groups within the population. A natural remedy is to remove spurious features from the model. However, in this work we show that removal of spurious features can decrease accuracy due to the inductive biases of overparameterized models. We completely characterize how the removal of spurious features affects accuracy across different groups (more generally, test distributions) in noiseless overparameterized linear regression. In addition, we show that removal of spurious feature can decrease the accuracy even in balanced datasets -- each target co-occurs equally with each spurious feature; and it can inadvertently make the model more susceptible to other spurious features. Finally, we show that robust self-training can remove spurious features without affecting the overall accuracy. Experiments on the Toxic-Comment-Detectoin and CelebA datasets show that our results hold in non-linear models.

Via

Access Paper or Ask Questions

Noise Induces Loss Discrepancy Across Groups for Linear Regression

Nov 22, 2019
Fereshte Khani, Percy Liang

Figure 1 for Noise Induces Loss Discrepancy Across Groups for Linear Regression

Figure 2 for Noise Induces Loss Discrepancy Across Groups for Linear Regression

Figure 3 for Noise Induces Loss Discrepancy Across Groups for Linear Regression

Figure 4 for Noise Induces Loss Discrepancy Across Groups for Linear Regression

We study the effect of feature noise (measurement error) on the discrepancy between losses across two groups (e.g., men and women) in the context of linear regression. Our main finding is that adding even the same amount of noise on all individuals impacts groups differently. We characterize several forms of loss discrepancy in terms of the amount of noise and difference between moments of the two groups, for estimators that either do or do not use group membership information. We then study how long it takes for an estimator to adapt to a shift in the population that makes the groups have the same mean. We finally validate our results on three real-world datasets.

Via

Access Paper or Ask Questions

Maximum Weighted Loss Discrepancy

Jun 08, 2019
Fereshte Khani, Aditi Raghunathan, Percy Liang

Figure 1 for Maximum Weighted Loss Discrepancy

Figure 2 for Maximum Weighted Loss Discrepancy

Figure 3 for Maximum Weighted Loss Discrepancy

Figure 4 for Maximum Weighted Loss Discrepancy

Though machine learning algorithms excel at minimizing the average loss over a population, this might lead to large discrepancies between the losses across groups within the population. To capture this inequality, we introduce and study a notion we call maximum weighted loss discrepancy (MWLD), the maximum (weighted) difference between the loss of a group and the loss of the population. We relate MWLD to group fairness notions and robustness to demographic shifts. We then show MWLD satisfies the following three properties: 1) It is statistically impossible to estimate MWLD when all groups have equal weights. 2) For a particular family of weighting functions, we can estimate MWLD efficiently. 3) MWLD is related to loss variance, a quantity that arises in generalization bounds. We estimate MWLD with different weighting functions on four common datasets from the fairness literature. We finally show that loss variance regularization can halve the loss variance of a classifier and hence reduce MWLD without suffering a significant drop in accuracy.

* ICLR 2019 Workshop. Safe Machine Learning: Specification, Robustness, and Assurance

Via

Access Paper or Ask Questions

Planning, Inference and Pragmatics in Sequential Language Games

May 30, 2018
Fereshte Khani, Noah D. Goodman, Percy Liang

We study sequential language games in which two players, each with private information, communicate to achieve a common goal. In such games, a successful player must (i) infer the partner's private information from the partner's messages, (ii) generate messages that are most likely to help with the goal, and (iii) reason pragmatically about the partner's strategy. We propose a model that captures all three characteristics and demonstrate their importance in capturing human behavior on a new goal-oriented dataset we collected using crowdsourcing.

* In proceedings of TACL 2018

Via

Access Paper or Ask Questions

Unanimous Prediction for 100% Precision with Application to Learning Semantic Mappings

Jun 23, 2016
Fereshte Khani, Martin Rinard, Percy Liang

Figure 1 for Unanimous Prediction for 100% Precision with Application to Learning Semantic Mappings

Figure 2 for Unanimous Prediction for 100% Precision with Application to Learning Semantic Mappings

Figure 3 for Unanimous Prediction for 100% Precision with Application to Learning Semantic Mappings

Figure 4 for Unanimous Prediction for 100% Precision with Application to Learning Semantic Mappings

Can we train a system that, on any new input, either says "don't know" or makes a prediction that is guaranteed to be correct? We answer the question in the affirmative provided our model family is well-specified. Specifically, we introduce the unanimity principle: only predict when all models consistent with the training data predict the same output. We operationalize this principle for semantic parsing, the task of mapping utterances to logical forms. We develop a simple, efficient method that reasons over the infinite set of all consistent models by only checking two of the models. We prove that our method obtains 100% precision even with a modest amount of training data from a possibly adversarial distribution. Empirically, we demonstrate the effectiveness of our approach on the standard GeoQuery dataset.

* ACL 2016, Removed the duplicate author name of the previous version

Via

Access Paper or Ask Questions