Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gunnar König

Performative Validity of Recourse Explanations

Jun 18, 2025

Gunnar König, Hidde Fokkema, Timo Freiesleben, Celestine Mendler-Dünner, Ulrike Von Luxburg

Abstract:When applicants get rejected by an algorithmic decision system, recourse explanations provide actionable suggestions for how to change their input features to get a positive evaluation. A crucial yet overlooked phenomenon is that recourse explanations are performative: When many applicants act according to their recommendations, their collective behavior may change statistical regularities in the data and, once the model is refitted, also the decision boundary. Consequently, the recourse algorithm may render its own recommendations invalid, such that applicants who make the effort of implementing their recommendations may be rejected again when they reapply. In this work, we formally characterize the conditions under which recourse explanations remain valid under performativity. A key finding is that recourse actions may become invalid if they are influenced by or if they intervene on non-causal variables. Based on our analysis, we caution against the use of standard counterfactual explanations and causal recourse methods, and instead advocate for recourse methods that recommend actions exclusively on causal variables.

* 34 pages, 3 figures, 1 table, Preprint

Via

Access Paper or Ask Questions

Disentangling Interactions and Dependencies in Feature Attribution

Oct 31, 2024

Gunnar König, Eric Günther, Ulrike von Luxburg

Figure 1 for Disentangling Interactions and Dependencies in Feature Attribution

Figure 2 for Disentangling Interactions and Dependencies in Feature Attribution

Figure 3 for Disentangling Interactions and Dependencies in Feature Attribution

Figure 4 for Disentangling Interactions and Dependencies in Feature Attribution

Abstract:In explainable machine learning, global feature importance methods try to determine how much each individual feature contributes to predicting the target variable, resulting in one importance score for each feature. But often, predicting the target variable requires interactions between several features (such as in the XOR function), and features might have complex statistical dependencies that allow to partially replace one feature with another one. In commonly used feature importance scores these cooperative effects are conflated with the features' individual contributions, making them prone to misinterpretations. In this work, we derive DIP, a new mathematical decomposition of individual feature importance scores that disentangles three components: the standalone contribution and the contributions stemming from interactions and dependencies. We prove that the DIP decomposition is unique and show how it can be estimated in practice. Based on these results, we propose a new visualization of feature importance scores that clearly illustrates the different contributions.

* GK and EG contributed equally to this article

Via

Access Paper or Ask Questions

A Guide to Feature Importance Methods for Scientific Inference

Apr 19, 2024

Fiona Katharina Ewald, Ludwig Bothmann, Marvin N. Wright, Bernd Bischl, Giuseppe Casalicchio, Gunnar König

Abstract:While machine learning (ML) models are increasingly used due to their high predictive power, their use in understanding the data-generating process (DGP) is limited. Understanding the DGP requires insights into feature-target associations, which many ML models cannot directly provide, due to their opaque internal mechanisms. Feature importance (FI) methods provide useful insights into the DGP under certain conditions. Since the results of different FI methods have different interpretations, selecting the correct FI method for a concrete use case is crucial and still requires expert knowledge. This paper serves as a comprehensive guide to help understand the different interpretations of FI methods. Through an extensive review of FI methods and providing new proofs regarding their interpretation, we facilitate a thorough understanding of these methods and formulate concrete recommendations for scientific inference. We conclude by discussing options for FI uncertainty estimation and point to directions for future research aiming at full statistical inference from black-box ML models.

* Accepted at the 2nd World Conference on eXplainable Artificial Intelligence, xAI-2024

Via

Access Paper or Ask Questions

CountARFactuals -- Generating plausible model-agnostic counterfactual explanations with adversarial random forests

Apr 04, 2024

Susanne Dandl, Kristin Blesch, Timo Freiesleben, Gunnar König, Jan Kapar, Bernd Bischl, Marvin Wright

Figure 1 for CountARFactuals -- Generating plausible model-agnostic counterfactual explanations with adversarial random forests

Figure 2 for CountARFactuals -- Generating plausible model-agnostic counterfactual explanations with adversarial random forests

Figure 3 for CountARFactuals -- Generating plausible model-agnostic counterfactual explanations with adversarial random forests

Figure 4 for CountARFactuals -- Generating plausible model-agnostic counterfactual explanations with adversarial random forests

Abstract:Counterfactual explanations elucidate algorithmic decisions by pointing to scenarios that would have led to an alternative, desired outcome. Giving insight into the model's behavior, they hint users towards possible actions and give grounds for contesting decisions. As a crucial factor in achieving these goals, counterfactuals must be plausible, i.e., describing realistic alternative scenarios within the data manifold. This paper leverages a recently developed generative modeling technique -- adversarial random forests (ARFs) -- to efficiently generate plausible counterfactuals in a model-agnostic way. ARFs can serve as a plausibility measure or directly generate counterfactual explanations. Our ARF-based approach surpasses the limitations of existing methods that aim to generate plausible counterfactual explanations: It is easy to train and computationally highly efficient, handles continuous and categorical data naturally, and allows integrating additional desiderata such as sparsity in a straightforward manner.

* SD, KB, TB, and GK contributed equally as first authors

Via

Access Paper or Ask Questions

Dear XAI Community, We Need to Talk! Fundamental Misconceptions in Current XAI Research

Jun 07, 2023

Timo Freiesleben, Gunnar König

Figure 1 for Dear XAI Community, We Need to Talk! Fundamental Misconceptions in Current XAI Research

Figure 2 for Dear XAI Community, We Need to Talk! Fundamental Misconceptions in Current XAI Research

Figure 3 for Dear XAI Community, We Need to Talk! Fundamental Misconceptions in Current XAI Research

Figure 4 for Dear XAI Community, We Need to Talk! Fundamental Misconceptions in Current XAI Research

Abstract:Despite progress in the field, significant parts of current XAI research are still not on solid conceptual, ethical, or methodological grounds. Unfortunately, these unfounded parts are not on the decline but continue to grow. Many explanation techniques are still proposed without clarifying their purpose. Instead, they are advertised with ever more fancy-looking heatmaps or only seemingly relevant benchmarks. Moreover, explanation techniques are motivated with questionable goals, such as building trust, or rely on strong assumptions about the 'concepts' that deep learning algorithms learn. In this paper, we highlight and discuss these and other misconceptions in current XAI research. We also suggest steps to make XAI a more substantive area of research.

* A revised version of this preprint has been accepted at the World XAI Conference. It will be referenced as soon as it is published

Via

Access Paper or Ask Questions

Efficient SAGE Estimation via Causal Structure Learning

Apr 06, 2023

Christoph Luther, Gunnar König, Moritz Grosse-Wentrup

Figure 1 for Efficient SAGE Estimation via Causal Structure Learning

Figure 2 for Efficient SAGE Estimation via Causal Structure Learning

Figure 3 for Efficient SAGE Estimation via Causal Structure Learning

Figure 4 for Efficient SAGE Estimation via Causal Structure Learning

Abstract:The Shapley Additive Global Importance (SAGE) value is a theoretically appealing interpretability method that fairly attributes global importance to a model's features. However, its exact calculation requires the computation of the feature's surplus performance contributions over an exponential number of feature sets. This is computationally expensive, particularly because estimating the surplus contributions requires sampling from conditional distributions. Thus, SAGE approximation algorithms only take a fraction of the feature sets into account. We propose $d$-SAGE, a method that accelerates SAGE approximation. $d$-SAGE is motivated by the observation that conditional independencies (CIs) between a feature and the model target imply zero surplus contributions, such that their computation can be skipped. To identify CIs, we leverage causal structure learning (CSL) to infer a graph that encodes (conditional) independencies in the data as $d$-separations. This is computationally more efficient because the expense of the one-time graph inference and the $d$-separation queries is negligible compared to the expense of surplus contribution evaluations. Empirically we demonstrate that $d$-SAGE enables the efficient and accurate estimation of SAGE values.

* equal contribution between Luther and K\"onig; accepted at AISTATS 2023

Via

Access Paper or Ask Questions

Improvement-Focused Causal Recourse (ICR)

Oct 27, 2022

Gunnar König, Timo Freiesleben, Moritz Grosse-Wentrup

Abstract:Algorithmic recourse recommendations, such as Karimi et al.'s (2021) causal recourse (CR), inform stakeholders of how to act to revert unfavourable decisions. However, some actions lead to acceptance (i.e., revert the model's decision) but do not lead to improvement (i.e., may not revert the underlying real-world state). To recommend such actions is to recommend fooling the predictor. We introduce a novel method, Improvement-Focused Causal Recourse (ICR), which involves a conceptual shift: Firstly, we require ICR recommendations to guide towards improvement. Secondly, we do not tailor the recommendations to be accepted by a specific predictor. Instead, we leverage causal knowledge to design decision systems that predict accurately pre- and post-recourse. As a result, improvement guarantees translate into acceptance guarantees. We demonstrate that given correct causal knowledge, ICR, in contrast to existing approaches, guides towards both acceptance and improvement.

* under review

Via

Access Paper or Ask Questions

Scientific Inference With Interpretable Machine Learning: Analyzing Models to Learn About Real-World Phenomena

Jun 11, 2022

Timo Freiesleben, Gunnar König, Christoph Molnar, Alvaro Tejero-Cantero

Figure 1 for Scientific Inference With Interpretable Machine Learning: Analyzing Models to Learn About Real-World Phenomena

Figure 2 for Scientific Inference With Interpretable Machine Learning: Analyzing Models to Learn About Real-World Phenomena

Figure 3 for Scientific Inference With Interpretable Machine Learning: Analyzing Models to Learn About Real-World Phenomena

Figure 4 for Scientific Inference With Interpretable Machine Learning: Analyzing Models to Learn About Real-World Phenomena

Abstract:Interpretable machine learning (IML) is concerned with the behavior and the properties of machine learning models. Scientists, however, are only interested in the model as a gateway to understanding the modeled phenomenon. We show how to develop IML methods such that they allow insight into relevant phenomenon properties. We argue that current IML research conflates two goals of model-analysis -- model audit and scientific inference. Thereby, it remains unclear if model interpretations have corresponding phenomenon interpretation. Building on statistical decision theory, we show that ML model analysis allows to describe relevant aspects of the joint data probability distribution. We provide a five-step framework for constructing IML descriptors that can help in addressing scientific questions, including a natural way to quantify epistemic uncertainty. Our phenomenon-centric approach to IML in science clarifies: the opportunities and limitations of IML for inference; that conditional not marginal sampling is required; and, the conditions under which we can trust IML methods.

Via

Access Paper or Ask Questions

Relating the Partial Dependence Plot and Permutation Feature Importance to the Data Generating Process

Sep 03, 2021

Christoph Molnar, Timo Freiesleben, Gunnar König, Giuseppe Casalicchio, Marvin N. Wright, Bernd Bischl

Figure 1 for Relating the Partial Dependence Plot and Permutation Feature Importance to the Data Generating Process

Figure 2 for Relating the Partial Dependence Plot and Permutation Feature Importance to the Data Generating Process

Figure 3 for Relating the Partial Dependence Plot and Permutation Feature Importance to the Data Generating Process

Figure 4 for Relating the Partial Dependence Plot and Permutation Feature Importance to the Data Generating Process

Abstract:Scientists and practitioners increasingly rely on machine learning to model data and draw conclusions. Compared to statistical modeling approaches, machine learning makes fewer explicit assumptions about data structures, such as linearity. However, their model parameters usually cannot be easily related to the data generating process. To learn about the modeled relationships, partial dependence (PD) plots and permutation feature importance (PFI) are often used as interpretation methods. However, PD and PFI lack a theory that relates them to the data generating process. We formalize PD and PFI as statistical estimators of ground truth estimands rooted in the data generating process. We show that PD and PFI estimates deviate from this ground truth due to statistical biases, model variance and Monte Carlo approximation errors. To account for model variance in PD and PFI estimation, we propose the learner-PD and the learner-PFI based on model refits, and propose corrected variance and confidence interval estimators.

Via

Access Paper or Ask Questions

A Causal Perspective on Meaningful and Robust Algorithmic Recourse

Jul 16, 2021

Gunnar König, Timo Freiesleben, Moritz Grosse-Wentrup

Figure 1 for A Causal Perspective on Meaningful and Robust Algorithmic Recourse

Figure 2 for A Causal Perspective on Meaningful and Robust Algorithmic Recourse

Abstract:Algorithmic recourse explanations inform stakeholders on how to act to revert unfavorable predictions. However, in general ML models do not predict well in interventional distributions. Thus, an action that changes the prediction in the desired way may not lead to an improvement of the underlying target. Such recourse is neither meaningful nor robust to model refits. Extending the work of Karimi et al. (2021), we propose meaningful algorithmic recourse (MAR) that only recommends actions that improve both prediction and target. We justify this selection constraint by highlighting the differences between model audit and meaningful, actionable recourse explanations. Additionally, we introduce a relaxation of MAR called effective algorithmic recourse (EAR), which, under certain assumptions, yields meaningful recourse by only allowing interventions on causes of the target.

* ICML (International Conference on Machine Learning) Workshop on Algorithmic Recourse

Via

Access Paper or Ask Questions