Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Patrik Kenfack

Towards Fair In-Context Learning with Tabular Foundation Models

May 15, 2025

Patrik Kenfack, Samira Ebrahimi Kahou, Ulrich Aïvodji

Abstract:Tabular foundational models have exhibited strong in-context learning (ICL) capabilities on structured data, allowing them to make accurate predictions on test sets without parameter updates, using training examples as context. This emerging approach positions itself as a competitive alternative to traditional gradient-boosted tree methods. However, while biases in conventional machine learning models are well documented, it remains unclear how these biases manifest in tabular ICL. The paper investigates the fairness implications of tabular ICL and explores three preprocessing strategies--correlation removal, group-balanced demonstration selection, and uncertainty-based demonstration selection--to address bias. Comprehensive experiments indicate that uncertainty-based demonstration selection consistently enhances group fairness of in-context predictions. The source code for reproducing the results of this work can be found at https://github.com/patrikken/Fair-TabICL.

* 24 pages, 10 figures, 4 tables

Via

Access Paper or Ask Questions

Adaptive Group Robust Ensemble Knowledge Distillation

Nov 22, 2024

Patrik Kenfack, Ulrich Aïvodji, Samira Ebrahimi Kahou

Figure 1 for Adaptive Group Robust Ensemble Knowledge Distillation

Figure 2 for Adaptive Group Robust Ensemble Knowledge Distillation

Figure 3 for Adaptive Group Robust Ensemble Knowledge Distillation

Figure 4 for Adaptive Group Robust Ensemble Knowledge Distillation

Abstract:Neural networks can learn spurious correlations in the data, often leading to performance disparity for underrepresented subgroups. Studies have demonstrated that the disparity is amplified when knowledge is distilled from a complex teacher model to a relatively "simple" student model. Prior work has shown that ensemble deep learning methods can improve the performance of the worst-case subgroups; however, it is unclear if this advantage carries over when distilling knowledge from an ensemble of teachers, especially when the teacher models are debiased. This study demonstrates that traditional ensemble knowledge distillation can significantly drop the performance of the worst-case subgroups in the distilled student model even when the teacher models are debiased. To overcome this, we propose Adaptive Group Robust Ensemble Knowledge Distillation (AGRE-KD), a simple ensembling strategy to ensure that the student model receives knowledge beneficial for unknown underrepresented subgroups. Leveraging an additional biased model, our method selectively chooses teachers whose knowledge would better improve the worst-performing subgroups by upweighting the teachers with gradient directions deviating from the biased model. Our experiments on several datasets demonstrate the superiority of the proposed ensemble distillation technique and show that it can even outperform classic model ensembles based on majority voting.

* Workshop Algorithmic Fairness through the Lens of Metrics and Evaluation at NeurIPS 2024

Via

Access Paper or Ask Questions