Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ben Glocker

Biomedical Image Analysis Group, Department of Computing, Imperial College London

Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention

Jun 11, 2024

Avinash Kori, Francesco Locatello, Ainkaran Santhirasekaram, Francesca Toni, Ben Glocker, Fabio De Sousa Ribeiro

Figure 1 for Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention

Figure 2 for Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention

Figure 3 for Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention

Figure 4 for Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention

Abstract:Learning modular object-centric representations is crucial for systematic generalization. Existing methods show promising object-binding capabilities empirically, but theoretical identifiability guarantees remain relatively underdeveloped. Understanding when object-centric representations can theoretically be identified is crucial for scaling slot-based methods to high-dimensional images with correctness guarantees. To that end, we propose a probabilistic slot-attention algorithm that imposes an aggregate mixture prior over object-centric slot representations, thereby providing slot identifiability guarantees without supervision, up to an equivalence relation. We provide empirical verification of our theoretical identifiability result using both simple 2-dimensional data and high-resolution imaging datasets.

Via

Access Paper or Ask Questions

The Importance of Model Inspection for Better Understanding Performance Characteristics of Graph Neural Networks

May 02, 2024

Nairouz Shehata, Carolina Piçarra, Anees Kazi, Ben Glocker

Figure 1 for The Importance of Model Inspection for Better Understanding Performance Characteristics of Graph Neural Networks

Figure 2 for The Importance of Model Inspection for Better Understanding Performance Characteristics of Graph Neural Networks

Figure 3 for The Importance of Model Inspection for Better Understanding Performance Characteristics of Graph Neural Networks

Abstract:This study highlights the importance of conducting comprehensive model inspection as part of comparative performance analyses. Here, we investigate the effect of modelling choices on the feature learning characteristics of graph neural networks applied to a brain shape classification task. Specifically, we analyse the effect of using parameter-efficient, shared graph convolutional submodels compared to structure-specific, non-shared submodels. Further, we assess the effect of mesh registration as part of the data harmonisation pipeline. We find substantial differences in the feature embeddings at different layers of the models. Our results highlight that test accuracy alone is insufficient to identify important model characteristics such as encoded biases related to data source or potentially non-discriminative features learned in submodels. Our model inspection framework offers a valuable tool for practitioners to better understand performance characteristics of deep learning models in medical imaging.

* International Symposium on Biomedical Imaging (ISBI)

Via

Access Paper or Ask Questions

Counterfactual contrastive learning: robust representations via causal image synthesis

Mar 14, 2024

Melanie Roschewitz, Fabio De Sousa Ribeiro, Tian Xia, Galvin Khara, Ben Glocker

Abstract:Contrastive pretraining is well-known to improve downstream task performance and model generalisation, especially in limited label settings. However, it is sensitive to the choice of augmentation pipeline. Positive pairs should preserve semantic information while destroying domain-specific information. Standard augmentation pipelines emulate domain-specific changes with pre-defined photometric transformations, but what if we could simulate realistic domain changes instead? In this work, we show how to utilise recent progress in counterfactual image generation to this effect. We propose CF-SimCLR, a counterfactual contrastive learning approach which leverages approximate counterfactual inference for positive pair creation. Comprehensive evaluation across five datasets, on chest radiography and mammography, demonstrates that CF-SimCLR substantially improves robustness to acquisition shift with higher downstream performance on both in- and out-of-distribution data, particularly for domains which are under-represented during training.

* Code available at https://github.com/biomedia-mira/counterfactual-contrastive

Via

Access Paper or Ask Questions

Mitigating attribute amplification in counterfactual image generation

Mar 14, 2024

Tian Xia, Mélanie Roschewitz, Fabio De Sousa Ribeiro, Charles Jones, Ben Glocker

Figure 1 for Mitigating attribute amplification in counterfactual image generation

Figure 2 for Mitigating attribute amplification in counterfactual image generation

Figure 3 for Mitigating attribute amplification in counterfactual image generation

Figure 4 for Mitigating attribute amplification in counterfactual image generation

Abstract:Causal generative modelling is gaining interest in medical imaging due to its ability to answer interventional and counterfactual queries. Most work focuses on generating counterfactual images that look plausible, using auxiliary classifiers to enforce effectiveness of simulated interventions. We investigate pitfalls in this approach, discovering the issue of attribute amplification, where unrelated attributes are spuriously affected during interventions, leading to biases across protected characteristics and disease status. We show that attribute amplification is caused by the use of hard labels in the counterfactual training process and propose soft counterfactual fine-tuning to mitigate this issue. Our method substantially reduces the amplification effect while maintaining effectiveness of generated images, demonstrated on a large chest X-ray dataset. Our work makes an important advancement towards more faithful and unbiased causal modelling in medical imaging.

Via

Access Paper or Ask Questions

Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation

Mar 11, 2024

Theodore Barfoot, Luis Garcia-Peraza-Herrera, Ben Glocker, Tom Vercauteren

Abstract:Deep neural networks for medical image segmentation often produce overconfident results misaligned with empirical observations. Such miscalibration, challenges their clinical translation. We propose to use marginal L1 average calibration error (mL1-ACE) as a novel auxiliary loss function to improve pixel-wise calibration without compromising segmentation quality. We show that this loss, despite using hard binning, is directly differentiable, bypassing the need for approximate but differentiable surrogate or soft binning approaches. Our work also introduces the concept of dataset reliability histograms which generalises standard reliability diagrams for refined visual assessment of calibration in semantic segmentation aggregated at the dataset level. Using mL1-ACE, we reduce average and maximum calibration error by 45% and 55% respectively, maintaining a Dice score of 87% on the BraTS 2021 dataset. We share our code here: https://github.com/cai4cai/ACE-DLIRIS

Via

Access Paper or Ask Questions

Demystifying Variational Diffusion Models

Jan 11, 2024

Fabio De Sousa Ribeiro, Ben Glocker

Figure 1 for Demystifying Variational Diffusion Models

Figure 2 for Demystifying Variational Diffusion Models

Figure 3 for Demystifying Variational Diffusion Models

Figure 4 for Demystifying Variational Diffusion Models

Abstract:Despite the growing popularity of diffusion models, gaining a deep understanding of the model class remains somewhat elusive for the uninitiated in non-equilibrium statistical physics. With that in mind, we present what we believe is a more straightforward introduction to diffusion models using directed graphical modelling and variational Bayesian principles, which imposes relatively fewer prerequisites on the average reader. Our exposition constitutes a comprehensive technical review spanning from foundational concepts like deep latent variable models to recent advances in continuous-time diffusion-based modelling, highlighting theoretical connections between model classes along the way. We provide additional mathematical insights that were omitted in the seminal works whenever possible to aid in understanding, while avoiding the introduction of new notation. We envision this article serving as a useful educational supplement for both researchers and practitioners in the area, and we welcome feedback and contributions from the community at https://github.com/biomedia-mira/demystifying-diffusion.

Via

Access Paper or Ask Questions

Robust semi-supervised segmentation with timestep ensembling diffusion models

Nov 13, 2023

Margherita Rosnati, Melanie Roschewitz, Ben Glocker

Figure 1 for Robust semi-supervised segmentation with timestep ensembling diffusion models

Figure 2 for Robust semi-supervised segmentation with timestep ensembling diffusion models

Figure 3 for Robust semi-supervised segmentation with timestep ensembling diffusion models

Figure 4 for Robust semi-supervised segmentation with timestep ensembling diffusion models

Abstract:Medical image segmentation is a challenging task, made more difficult by many datasets' limited size and annotations. Denoising diffusion probabilistic models (DDPM) have recently shown promise in modelling the distribution of natural images and were successfully applied to various medical imaging tasks. This work focuses on semi-supervised image segmentation using diffusion models, particularly addressing domain generalisation. Firstly, we demonstrate that smaller diffusion steps generate latent representations that are more robust for downstream tasks than larger steps. Secondly, we use this insight to propose an improved esembling scheme that leverages information-dense small steps and the regularising effect of larger steps to generate predictions. Our model shows significantly better performance in domain-shifted settings while retaining competitive performance in-domain. Overall, this work highlights the potential of DDPMs for semi-supervised medical image segmentation and provides insights into optimising their performance under domain shift.

* Published at Machine Learning for Health (ML4H) 2023, presented at Medical Imaging meets NeurIPS 2023 and Deep Generative Models for Health Workshop NeurIPS 2023

Via

Access Paper or Ask Questions

Analysing race and sex bias in brain age prediction

Sep 19, 2023

Carolina Piçarra, Ben Glocker

Abstract:Brain age prediction from MRI has become a popular imaging biomarker associated with a wide range of neuropathologies. The datasets used for training, however, are often skewed and imbalanced regarding demographics, potentially making brain age prediction models susceptible to bias. We analyse the commonly used ResNet-34 model by conducting a comprehensive subgroup performance analysis and feature inspection. The model is trained on 1,215 T1-weighted MRI scans from Cam-CAN and IXI, and tested on UK Biobank (n=42,786), split into six racial and biological sex subgroups. With the objective of comparing the performance between subgroups, measured by the absolute prediction error, we use a Kruskal-Wallis test followed by two post-hoc Conover-Iman tests to inspect bias across race and biological sex. To examine biases in the generated features, we use PCA for dimensionality reduction and employ two-sample Kolmogorov-Smirnov tests to identify distribution shifts among subgroups. Our results reveal statistically significant differences in predictive performance between Black and White, Black and Asian, and male and female subjects. Seven out of twelve pairwise comparisons show statistically significant differences in the feature distributions. Our findings call for further analysis of brain age prediction models.

* MICCAI Workshop on Fairness of AI in Medical Imaging (FAIMI 2023)

Via

Access Paper or Ask Questions

Robustness Stress Testing in Medical Image Classification

Aug 14, 2023

Mobarakol Islam, Zeju Li, Ben Glocker

Abstract:Deep neural networks have shown impressive performance for image-based disease detection. Performance is commonly evaluated through clinical validation on independent test sets to demonstrate clinically acceptable accuracy. Reporting good performance metrics on test sets, however, is not always a sufficient indication of the generalizability and robustness of an algorithm. In particular, when the test data is drawn from the same distribution as the training data, the iid test set performance can be an unreliable estimate of the accuracy on new data. In this paper, we employ stress testing to assess model robustness and subgroup performance disparities in disease detection models. We design progressive stress testing using five different bidirectional and unidirectional image perturbations with six different severity levels. As a use case, we apply stress tests to measure the robustness of disease detection models for chest X-ray and skin lesion images, and demonstrate the importance of studying class and domain-specific model behaviour. Our experiments indicate that some models may yield more robust and equitable performance than others. We also find that pretraining characteristics play an important role in downstream robustness. We conclude that progressive stress testing is a viable and important tool and should become standard practice in the clinical validation of image-based disease detection models.

* 11 pages

Via

Access Paper or Ask Questions

Distance Matters For Improving Performance Estimation Under Covariate Shift

Aug 14, 2023

Mélanie Roschewitz, Ben Glocker

Figure 1 for Distance Matters For Improving Performance Estimation Under Covariate Shift

Figure 2 for Distance Matters For Improving Performance Estimation Under Covariate Shift

Figure 3 for Distance Matters For Improving Performance Estimation Under Covariate Shift

Figure 4 for Distance Matters For Improving Performance Estimation Under Covariate Shift

Abstract:Performance estimation under covariate shift is a crucial component of safe AI model deployment, especially for sensitive use-cases. Recently, several solutions were proposed to tackle this problem, most leveraging model predictions or softmax confidence to derive accuracy estimates. However, under dataset shifts, confidence scores may become ill-calibrated if samples are too far from the training distribution. In this work, we show that taking into account distances of test samples to their expected training distribution can significantly improve performance estimation under covariate shift. Precisely, we introduce a "distance-check" to flag samples that lie too far from the expected distribution, to avoid relying on their untrustworthy model outputs in the accuracy estimation step. We demonstrate the effectiveness of this method on 13 image classification tasks, across a wide-range of natural and synthetic distribution shifts and hundreds of models, with a median relative MAE improvement of 27% over the best baseline across all tasks, and SOTA performance on 10 out of 13 tasks. Our code is publicly available at https://github.com/melanibe/distance_matters_performance_estimation.

* Accepted to ICCV Workshop on Uncertainty Quantification for Computer Vision 2023

Via

Access Paper or Ask Questions