Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michael Riis Andersen

Same Graph, Different Likelihoods: Calibration of Autoregressive Graph Generators via Permutation-Equivalent Encodings

Apr 07, 2026

Laurits Fredsgaard, Aaron Thomas, Michael Riis Andersen, Mikkel N. Schmidt, Mahito Sugiyama

Abstract:Autoregressive graph generators define likelihoods via a sequential construction process, but these likelihoods are only meaningful if they are consistent across all linearizations of the same graph. Segmented Eulerian Neighborhood Trails (SENT), a recent linearization method, converts graphs into sequences that can be perfectly decoded and efficiently processed by language models, but admit multiple equivalent linearizations of the same graph. We quantify violations in assigned negative log-likelihood (NLL) using the coefficient of variation across equivalent linearizations, which we call Linearization Uncertainty (LU). Training transformers under four linearization strategies on two datasets, we show that biased orderings achieve lower NLL on their native order but exhibit expected calibration error (ECE) two orders of magnitude higher under random permutation, indicating that these models have learned their training linearization rather than the underlying graph. On the molecular graph benchmark QM9, NLL for generated graphs is negatively correlated with molecular stability (AUC $=0.43$), while LU achieves AUC $=0.85$, suggesting that permutation-based evaluation provides a more reliable quality check for generated molecules. Code is available at https://github.com/lauritsf/linearization-uncertainty

* Workshop 'Towards Trustworthy Predictions: Theory and Applications of Calibration for Modern AI' at AISTATS 2026, Tangier, Morocco

Via

Access Paper or Ask Questions

Practical Deep Heteroskedastic Regression

Mar 02, 2026

Mikkel Jordahn, Jonas Vestergaard Jensen, James Harrison, Michael Riis Andersen, Mikkel N. Schmidt

Abstract:Uncertainty quantification (UQ) in deep learning regression is of wide interest, as it supports critical applications including sequential decision making and risk-sensitive tasks. In heteroskedastic regression, where the uncertainty of the target depends on the input, a common approach is to train a neural network that parameterizes the mean and the variance of the predictive distribution. Still, training deep heteroskedastic regression models poses practical challenges in the trade-off between uncertainty quantification and mean prediction, such as optimization difficulties, representation collapse, and variance overfitting. In this work we identify previously undiscussed fallacies and propose a simple and efficient procedure that addresses these challenges jointly by post-hoc fitting a variance model across the intermediate layers of a pretrained network on a hold-out dataset. We demonstrate that our method achieves on-par or state-of-the-art uncertainty quantification on several molecular graph datasets, without compromising mean prediction accuracy and remaining cheap to use at prediction time.

Via

Access Paper or Ask Questions

On Local Posterior Structure in Deep Ensembles

Mar 17, 2025

Mikkel Jordahn, Jonas Vestergaard Jensen, Mikkel N. Schmidt, Michael Riis Andersen

Abstract:Bayesian Neural Networks (BNNs) often improve model calibration and predictive uncertainty quantification compared to point estimators such as maximum-a-posteriori (MAP). Similarly, deep ensembles (DEs) are also known to improve calibration, and therefore, it is natural to hypothesize that deep ensembles of BNNs (DE-BNNs) should provide even further improvements. In this work, we systematically investigate this across a number of datasets, neural network architectures, and BNN approximation methods and surprisingly find that when the ensembles grow large enough, DEs consistently outperform DE-BNNs on in-distribution data. To shine light on this observation, we conduct several sensitivity and ablation studies. Moreover, we show that even though DE-BNNs outperform DEs on out-of-distribution metrics, this comes at the cost of decreased in-distribution performance. As a final contribution, we open-source the large pool of trained models to facilitate further research on this topic.

* Code and models available at https://github.com/jonasvj/OnLocalPosteriorStructureInDeepEnsembles

Via

Access Paper or Ask Questions

GeoFormer: A Multi-Polygon Segmentation Transformer

Nov 25, 2024

Maxim Khomiakov, Michael Riis Andersen, Jes Frellsen

Figure 1 for GeoFormer: A Multi-Polygon Segmentation Transformer

Figure 2 for GeoFormer: A Multi-Polygon Segmentation Transformer

Figure 3 for GeoFormer: A Multi-Polygon Segmentation Transformer

Figure 4 for GeoFormer: A Multi-Polygon Segmentation Transformer

Abstract:In remote sensing there exists a common need for learning scale invariant shapes of objects like buildings. Prior works relies on tweaking multiple loss functions to convert segmentation maps into the final scale invariant representation, necessitating arduous design and optimization. For this purpose we introduce the GeoFormer, a novel architecture which presents a remedy to the said challenges, learning to generate multipolygons end-to-end. By modeling keypoints as spatially dependent tokens in an auto-regressive manner, the GeoFormer outperforms existing works in delineating building objects from satellite imagery. We evaluate the robustness of the GeoFormer against former methods through a variety of parameter ablations and highlight the advantages of optimizing a single likelihood function. Our study presents the first successful application of auto-regressive transformer models for multi-polygon predictions in remote sensing, suggesting a promising methodological alternative for building vectorization.

* 21 pages, 5 figures, in proceedings of British Machine Vision Conference 2024

Via

Access Paper or Ask Questions

EB-NeRD: A Large-Scale Dataset for News Recommendation

Oct 04, 2024

Johannes Kruse, Kasper Lindskow, Saikishore Kalloori, Marco Polignano, Claudio Pomo, Abhishek Srivastava, Anshuk Uppal, Michael Riis Andersen, Jes Frellsen

Figure 1 for EB-NeRD: A Large-Scale Dataset for News Recommendation

Figure 2 for EB-NeRD: A Large-Scale Dataset for News Recommendation

Figure 3 for EB-NeRD: A Large-Scale Dataset for News Recommendation

Figure 4 for EB-NeRD: A Large-Scale Dataset for News Recommendation

Abstract:Personalized content recommendations have been pivotal to the content experience in digital media from video streaming to social networks. However, several domain specific challenges have held back adoption of recommender systems in news publishing. To address these challenges, we introduce the Ekstra Bladet News Recommendation Dataset (EB-NeRD). The dataset encompasses data from over a million unique users and more than 37 million impression logs from Ekstra Bladet. It also includes a collection of over 125,000 Danish news articles, complete with titles, abstracts, bodies, and metadata, such as categories. EB-NeRD served as the benchmark dataset for the RecSys '24 Challenge, where it was demonstrated how the dataset can be used to address both technical and normative challenges in designing effective and responsible recommender systems for news publishing. The dataset is available at: https://recsys.eb.dk.

* 11 pages, 8 tables, 2 figures, RecSys '24

Via

Access Paper or Ask Questions

RecSys Challenge 2024: Balancing Accuracy and Editorial Values in News Recommendations

Sep 30, 2024

Johannes Kruse, Kasper Lindskow, Saikishore Kalloori, Marco Polignano, Claudio Pomo, Abhishek Srivastava, Anshuk Uppal, Michael Riis Andersen, Jes Frellsen

Figure 1 for RecSys Challenge 2024: Balancing Accuracy and Editorial Values in News Recommendations

Figure 2 for RecSys Challenge 2024: Balancing Accuracy and Editorial Values in News Recommendations

Figure 3 for RecSys Challenge 2024: Balancing Accuracy and Editorial Values in News Recommendations

Abstract:The RecSys Challenge 2024 aims to advance news recommendation by addressing both the technical and normative challenges inherent in designing effective and responsible recommender systems for news publishing. This paper describes the challenge, including its objectives, problem setting, and the dataset provided by the Danish news publishers Ekstra Bladet and JP/Politikens Media Group ("Ekstra Bladet"). The challenge explores the unique aspects of news recommendation, such as modeling user preferences based on behavior, accounting for the influence of the news agenda on user interests, and managing the rapid decay of news items. Additionally, the challenge embraces normative complexities, investigating the effects of recommender systems on news flow and their alignment with editorial values. We summarize the challenge setup, dataset characteristics, and evaluation metrics. Finally, we announce the winners and highlight their contributions. The dataset is available at: https://recsys.eb.dk.

* 5 pages, 3 tables, RecSys' 24

Via

Access Paper or Ask Questions

Variance reduction of diffusion model's gradients with Taylor approximation-based control variate

Aug 22, 2024

Paul Jeha, Will Grathwohl, Michael Riis Andersen, Carl Henrik Ek, Jes Frellsen

Figure 1 for Variance reduction of diffusion model's gradients with Taylor approximation-based control variate

Figure 2 for Variance reduction of diffusion model's gradients with Taylor approximation-based control variate

Figure 3 for Variance reduction of diffusion model's gradients with Taylor approximation-based control variate

Figure 4 for Variance reduction of diffusion model's gradients with Taylor approximation-based control variate

Abstract:Score-based models, trained with denoising score matching, are remarkably effective in generating high dimensional data. However, the high variance of their training objective hinders optimisation. We attempt to reduce it with a control variate, derived via a $k$-th order Taylor expansion on the training objective and its gradient. We prove an equivalence between the two and demonstrate empirically the effectiveness of our approach on a low dimensional problem setting; and study its effect on larger problems.

* 14 pages, ICML Structured Probabilistic Inference & Generative Modeling 2024

Via

Access Paper or Ask Questions

Neural machine translation for automated feedback on children's early-stage writing

Nov 15, 2023

Jonas Vestergaard Jensen, Mikkel Jordahn, Michael Riis Andersen

Figure 1 for Neural machine translation for automated feedback on children's early-stage writing

Figure 2 for Neural machine translation for automated feedback on children's early-stage writing

Abstract:In this work, we address the problem of assessing and constructing feedback for early-stage writing automatically using machine learning. Early-stage writing is typically vastly different from conventional writing due to phonetic spelling and lack of proper grammar, punctuation, spacing etc. Consequently, early-stage writing is highly non-trivial to analyze using common linguistic metrics. We propose to use sequence-to-sequence models for "translating" early-stage writing by students into "conventional" writing, which allows the translated text to be analyzed using linguistic metrics. Furthermore, we propose a novel robust likelihood to mitigate the effect of noise in the dataset. We investigate the proposed methods using a set of numerical experiments and demonstrate that the conventional text can be predicted with high accuracy.

* 9 pages, 1 figure, 1 table, to be published in the proceedings of the Northern Lights Deep Learning Conference 2024

Via

Access Paper or Ask Questions

Polygonizer: An auto-regressive building delineator

Apr 08, 2023

Maxim Khomiakov, Michael Riis Andersen, Jes Frellsen

Figure 1 for Polygonizer: An auto-regressive building delineator

Figure 2 for Polygonizer: An auto-regressive building delineator

Figure 3 for Polygonizer: An auto-regressive building delineator

Figure 4 for Polygonizer: An auto-regressive building delineator

Abstract:In geospatial planning, it is often essential to represent objects in a vectorized format, as this format easily translates to downstream tasks such as web development, graphics, or design. While these problems are frequently addressed using semantic segmentation, which requires additional post-processing to vectorize objects in a non-trivial way, we present an Image-to-Sequence model that allows for direct shape inference and is ready for vector-based workflows out of the box. We demonstrate the model's performance in various ways, including perturbations to the image input that correspond to variations or artifacts commonly encountered in remote sensing applications. Our model outperforms prior works when using ground truth bounding boxes (one object per image), achieving the lowest maximum tangent angle error.

* ICLR 2023 Workshop on Machine Learning in Remote Sensing

Via

Access Paper or Ask Questions

Learning to Generate 3D Representations of Building Roofs Using Single-View Aerial Imagery

Mar 20, 2023

Maxim Khomiakov, Alejandro Valverde Mahou, Alba Reinders Sánchez, Jes Frellsen, Michael Riis Andersen

Figure 1 for Learning to Generate 3D Representations of Building Roofs Using Single-View Aerial Imagery

Figure 2 for Learning to Generate 3D Representations of Building Roofs Using Single-View Aerial Imagery

Figure 3 for Learning to Generate 3D Representations of Building Roofs Using Single-View Aerial Imagery

Figure 4 for Learning to Generate 3D Representations of Building Roofs Using Single-View Aerial Imagery

Abstract:We present a novel pipeline for learning the conditional distribution of a building roof mesh given pixels from an aerial image, under the assumption that roof geometry follows a set of regular patterns. Unlike alternative methods that require multiple images of the same object, our approach enables estimating 3D roof meshes using only a single image for predictions. The approach employs the PolyGen, a deep generative transformer architecture for 3D meshes. We apply this model in a new domain and investigate the sensitivity of the image resolution. We propose a novel metric to evaluate the performance of the inferred meshes, and our results show that the model is robust even at lower resolutions, while qualitatively producing realistic representations for out-of-distribution samples.

* Copyright 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Via

Access Paper or Ask Questions