Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anders Gjølbye

Missing-Data-Induced Phase Transitions in Spectral PLS for Multimodal Learning

Jan 29, 2026

Anders Gjølbye, Ida Kargaard, Emma Kargaard, Lars Kai Hansen

Abstract:Partial Least Squares (PLS) learns shared structure from paired data via the top singular vectors of the empirical cross-covariance (PLS-SVD), but multimodal datasets often have missing entries in both views. We study PLS-SVD under independent entry-wise missing-completely-at-random masking in a proportional high-dimensional spiked model. After appropriate normalization, the masked cross-covariance behaves like a spiked rectangular random matrix whose effective signal strength is attenuated by $\sqrtρ$, where $ρ$ is the joint entry retention probability. As a result, PLS-SVD exhibits a sharp BBP-type phase transition: below a critical signal-to-noise threshold the leading singular vectors are asymptotically uninformative, while above it they achieve nontrivial alignment with the latent shared directions, with closed-form asymptotic overlap formulas. Simulations and semi-synthetic multimodal experiments corroborate the predicted phase diagram and recovery curves across aspect ratios, signal strengths, and missingness levels.

* Preprint

Via

Access Paper or Ask Questions

Large Vision Models Can Solve Mental Rotation Problems

Sep 18, 2025

Sebastian Ray Mason, Anders Gjølbye, Phillip Chavarria Højbjerg, Lenka Tětková, Lars Kai Hansen

Abstract:Mental rotation is a key test of spatial reasoning in humans and has been central to understanding how perception supports cognition. Despite the success of modern vision transformers, it is still unclear how well these models develop similar abilities. In this work, we present a systematic evaluation of ViT, CLIP, DINOv2, and DINOv3 across a range of mental-rotation tasks, from simple block structures similar to those used by Shepard and Metzler to study human cognition, to more complex block figures, three types of text, and photo-realistic objects. By probing model representations layer by layer, we examine where and how these networks succeed. We find that i) self-supervised ViTs capture geometric structure better than supervised ViTs; ii) intermediate layers perform better than final layers; iii) task difficulty increases with rotation complexity and occlusion, mirroring human reaction times and suggesting similar constraints in embedding space representations.

Via

Access Paper or Ask Questions

Minimizing False-Positive Attributions in Explanations of Non-Linear Models

May 16, 2025

Anders Gjølbye, Stefan Haufe, Lars Kai Hansen

Figure 1 for Minimizing False-Positive Attributions in Explanations of Non-Linear Models

Figure 2 for Minimizing False-Positive Attributions in Explanations of Non-Linear Models

Figure 3 for Minimizing False-Positive Attributions in Explanations of Non-Linear Models

Figure 4 for Minimizing False-Positive Attributions in Explanations of Non-Linear Models

Abstract:Suppressor variables can influence model predictions without being dependent on the target outcome and they pose a significant challenge for Explainable AI (XAI) methods. These variables may cause false-positive feature attributions, undermining the utility of explanations. Although effective remedies exist for linear models, their extension to non-linear models and to instance-based explanations has remained limited. We introduce PatternLocal, a novel XAI technique that addresses this gap. PatternLocal begins with a locally linear surrogate, e.g. LIME, KernelSHAP, or gradient-based methods, and transforms the resulting discriminative model weights into a generative representation, thereby suppressing the influence of suppressor variables while preserving local fidelity. In extensive hyperparameter optimization on the XAI-TRIS benchmark, PatternLocal consistently outperformed other XAI methods and reduced false-positive attributions when explaining non-linear tasks, thereby enabling more reliable and actionable insights.

* Preprint. Under review

Via

Access Paper or Ask Questions

SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning

Aug 15, 2024

Anders Gjølbye, Lina Skerath, William Lehn-Schiøler, Nicolas Langer, Lars Kai Hansen

Figure 1 for SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning

Figure 2 for SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning

Figure 3 for SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning

Figure 4 for SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning

Abstract:Electroencephalography (EEG) research typically focuses on tasks with narrowly defined objectives, but recent studies are expanding into the use of unlabeled data within larger models, aiming for a broader range of applications. This addresses a critical challenge in EEG research. For example, Kostas et al. (2021) show that self-supervised learning (SSL) outperforms traditional supervised methods. Given the high noise levels in EEG data, we argue that further improvements are possible with additional preprocessing. Current preprocessing methods often fail to efficiently manage the large data volumes required for SSL, due to their lack of optimization, reliance on subjective manual corrections, and validation processes or inflexible protocols that limit SSL. We propose a Python-based EEG preprocessing pipeline optimized for self-supervised learning, designed to efficiently process large-scale data. This optimization not only stabilizes self-supervised training but also enhances performance on downstream tasks compared to training with raw data.

* To appear in proceedings of 2024 IEEE International workshop on Machine Learning for Signal Processing

Via

Access Paper or Ask Questions