Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Oct 19, 2022

Haoran Zhang, Harvineet Singh, Marzyeh Ghassemi, Shalmali Joshi

Figure 1 for "Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Figure 2 for "Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Figure 3 for "Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Figure 4 for "Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Share this with someone who'll enjoy it:

Abstract:Performance of machine learning models may differ between training and deployment for many reasons. For instance, model performance can change between environments due to changes in data quality, observing a different population than the one in training, or changes in the relationship between labels and features. These manifest as changes to the underlying data generating mechanisms, and thereby result in distribution shifts across environments. Attributing performance changes to specific shifts, such as covariate or concept shifts, is critical for identifying sources of model failures, and for taking mitigating actions that ensure robust models. In this work, we introduce the problem of attributing performance differences between environments to shifts in the underlying data generating mechanisms. We formulate the problem as a cooperative game and derive an importance weighting method for computing the value of a coalition (or a set) of distributions. The contribution of each distribution to the total performance change is then quantified as its Shapley value. We demonstrate the correctness and utility of our method on two synthetic datasets and two real-world case studies, showing its effectiveness in attributing performance changes to a wide range of distribution shifts.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Paper and Code