Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Elodie Germani

EMPENN, LACODAM

Bias and Generalizability of Foundation Models across Datasets in Breast Mammography

May 19, 2025

Elodie Germani, Ilayda Selin Türk, Fatima Zeineddine, Charbel Mourad, Shadi Albarqouni

Abstract:Over the past decades, computer-aided diagnosis tools for breast cancer have been developed to enhance screening procedures, yet their clinical adoption remains challenged by data variability and inherent biases. Although foundation models (FMs) have recently demonstrated impressive generalizability and transfer learning capabilities by leveraging vast and diverse datasets, their performance can be undermined by spurious correlations that arise from variations in image quality, labeling uncertainty, and sensitive patient attributes. In this work, we explore the fairness and bias of FMs for breast mammography classification by leveraging a large pool of datasets from diverse sources-including data from underrepresented regions and an in-house dataset. Our extensive experiments show that while modality-specific pre-training of FMs enhances performance, classifiers trained on features from individual datasets fail to generalize across domains. Aggregating datasets improves overall performance, yet does not fully mitigate biases, leading to significant disparities across under-represented subgroups such as extreme breast densities and age groups. Furthermore, while domain-adaptation strategies can reduce these disparities, they often incur a performance trade-off. In contrast, fairness-aware techniques yield more stable and equitable performance across subgroups. These findings underscore the necessity of incorporating rigorous fairness evaluations and mitigation strategies into FM-based models to foster inclusive and generalizable AI.

* Accepted at the International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2025

Via

Access Paper or Ask Questions

Mitigating analytical variability in fMRI results with style transfer

Apr 04, 2024

Elodie Germani, Elisa Fromont, Camille Maumet

Abstract:We propose a novel approach to improve the reproducibility of neuroimaging results by converting statistic maps across different functional MRI pipelines. We make the assumption that pipelines can be considered as a style component of data and propose to use different generative models, among which, Diffusion Models (DM) to convert data between pipelines. We design a new DM-based unsupervised multi-domain image-to-image transition framework and constrain the generation of 3D fMRI statistic maps using the latent space of an auxiliary classifier that distinguishes statistic maps from different pipelines. We extend traditional sampling techniques used in DM to improve the transition performance. Our experiments demonstrate that our proposed methods are successful: pipelines can indeed be transferred, providing an important source of data augmentation for future medical studies.

Via

Access Paper or Ask Questions

Uncovering communities of pipelines in the task-fMRI analytical space

Dec 11, 2023

Elodie Germani, Elisa Fromont, Camille Maumet

Figure 1 for Uncovering communities of pipelines in the task-fMRI analytical space

Figure 2 for Uncovering communities of pipelines in the task-fMRI analytical space

Figure 3 for Uncovering communities of pipelines in the task-fMRI analytical space

Figure 4 for Uncovering communities of pipelines in the task-fMRI analytical space

Abstract:Functional magnetic resonance imaging analytical workflows are highly flexible with no definite consensus on how to choose a pipeline. While methods have been developed to explore this analytical space, there is still a lack of understanding of the relationships between the different pipelines. We use community detection algorithms to explore the pipeline space and assess its stability across different contexts. We show that there are subsets of pipelines that give similar results, especially those sharing specific parameters (e.g. number of motion regressors, software packages, etc.), with relative stability across groups of participants. By visualizing the differences between these subsets, we describe the effect of pipeline parameters and derive general relationships in the analytical space.

Via

Access Paper or Ask Questions

On the benefits of self-taught learning for brain decoding

Sep 19, 2022

Elodie Germani, Elisa Fromont, Camille Maumet

Figure 1 for On the benefits of self-taught learning for brain decoding

Figure 2 for On the benefits of self-taught learning for brain decoding

Figure 3 for On the benefits of self-taught learning for brain decoding

Figure 4 for On the benefits of self-taught learning for brain decoding

Abstract:We study the benefits of using a large public neuroimaging database composed of fMRI statistic maps, in a self-taught learning framework, for improving brain decoding on new tasks. First, we leverage the NeuroVault database to train, on a selection of relevant statistic maps, a convolutional autoencoder to reconstruct these maps. Then, we use this trained encoder to initialize a supervised convolutional neural network to classify tasks or cognitive processes of unseen statistic maps from large collections of the NeuroVault database. We show that such a self-taught learning process always improves the performance of the classifiers but the magnitude of the benefits strongly depends on the number of data available both for pre-training and finetuning the models and on the complexity of the targeted downstream task.

Via

Access Paper or Ask Questions