Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fabiano Veglianti

Generalizability vs. Counterfactual Explainability Trade-Off

May 29, 2025

Fabiano Veglianti, Flavio Giorgi, Fabrizio Silvestri, Gabriele Tolomei

Abstract:In this work, we investigate the relationship between model generalization and counterfactual explainability in supervised learning. We introduce the notion of $\varepsilon$-valid counterfactual probability ($\varepsilon$-VCP) -- the probability of finding perturbations of a data point within its $\varepsilon$-neighborhood that result in a label change. We provide a theoretical analysis of $\varepsilon$-VCP in relation to the geometry of the model's decision boundary, showing that $\varepsilon$-VCP tends to increase with model overfitting. Our findings establish a rigorous connection between poor generalization and the ease of counterfactual generation, revealing an inherent trade-off between generalization and counterfactual explainability. Empirical results validate our theory, suggesting $\varepsilon$-VCP as a practical proxy for quantitatively characterizing overfitting.

* 9 pages, 4 figures, plus appendix. arXiv admin note: text overlap with arXiv:2502.09193

Via

Access Paper or Ask Questions

Generalizability through Explainability: Countering Overfitting with Counterfactual Examples

Feb 13, 2025

Flavio Giorgi, Fabiano Veglianti, Fabrizio Silvestri, Gabriele Tolomei

Figure 1 for Generalizability through Explainability: Countering Overfitting with Counterfactual Examples

Figure 2 for Generalizability through Explainability: Countering Overfitting with Counterfactual Examples

Figure 3 for Generalizability through Explainability: Countering Overfitting with Counterfactual Examples

Figure 4 for Generalizability through Explainability: Countering Overfitting with Counterfactual Examples

Abstract:Overfitting is a well-known issue in machine learning that occurs when a model struggles to generalize its predictions to new, unseen data beyond the scope of its training set. Traditional techniques to mitigate overfitting include early stopping, data augmentation, and regularization. In this work, we demonstrate that the degree of overfitting of a trained model is correlated with the ability to generate counterfactual examples. The higher the overfitting, the easier it will be to find a valid counterfactual example for a randomly chosen input data point. Therefore, we introduce CF-Reg, a novel regularization term in the training loss that controls overfitting by ensuring enough margin between each instance and its corresponding counterfactual. Experiments conducted across multiple datasets and models show that our counterfactual regularizer generally outperforms existing regularization techniques.

Via

Access Paper or Ask Questions

Effective Non-Random Extreme Learning Machine

Nov 25, 2024

Daniela De Canditiis, Fabiano Veglianti

Abstract:The Extreme Learning Machine (ELM) is a growing statistical technique widely applied to regression problems. In essence, ELMs are single-layer neural networks where the hidden layer weights are randomly sampled from a specific distribution, while the output layer weights are learned from the data. Two of the key challenges with this approach are the architecture design, specifically determining the optimal number of neurons in the hidden layer, and the method's sensitivity to the random initialization of hidden layer weights. This paper introduces a new and enhanced learning algorithm for regression tasks, the Effective Non-Random ELM (ENR-ELM), which simplifies the architecture design and eliminates the need for random hidden layer weight selection. The proposed method incorporates concepts from signal processing, such as basis functions and projections, into the ELM framework. We introduce two versions of the ENR-ELM: the approximated ENR-ELM and the incremental ENR-ELM. Experimental results on both synthetic and real datasets demonstrate that our method overcomes the problems of traditional ELM while maintaining comparable predictive performance.

Via

Access Paper or Ask Questions