Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Noah Hollmann

TabPFN-2.5: Advancing the State of the Art in Tabular Foundation Models

Nov 11, 2025

Léo Grinsztajn, Klemens Flöge, Oscar Key, Felix Birkel, Philipp Jund, Brendan Roof, Benjamin Jäger, Dominik Safaric, Simone Alessi, Adrian Hayler(+16 more)

Figure 1 for TabPFN-2.5: Advancing the State of the Art in Tabular Foundation Models

Figure 2 for TabPFN-2.5: Advancing the State of the Art in Tabular Foundation Models

Figure 3 for TabPFN-2.5: Advancing the State of the Art in Tabular Foundation Models

Figure 4 for TabPFN-2.5: Advancing the State of the Art in Tabular Foundation Models

Abstract:The first tabular foundation model, TabPFN, and its successor TabPFNv2 have impacted tabular AI substantially, with dozens of methods building on it and hundreds of applications across different use cases. This report introduces TabPFN-2.5, the next generation of our tabular foundation model, built for datasets with up to 50,000 data points and 2,000 features, a 20x increase in data cells compared to TabPFNv2. TabPFN-2.5 is now the leading method for the industry standard benchmark TabArena (which contains datasets with up to 100,000 training data points), substantially outperforming tuned tree-based models and matching the accuracy of AutoGluon 1.4, a complex four-hour tuned ensemble that even includes the previous TabPFNv2. Remarkably, default TabPFN-2.5 has a 100% win rate against default XGBoost on small to medium-sized classification datasets (<=10,000 data points, 500 features) and a 87% win rate on larger datasets up to 100K samples and 2K features (85% for regression). For production use cases, we introduce a new distillation engine that converts TabPFN-2.5 into a compact MLP or tree ensemble, preserving most of its accuracy while delivering orders-of-magnitude lower latency and plug-and-play deployment. This new release will immediately strengthen the performance of the many applications and methods already built on the TabPFN ecosystem.

Via

Access Paper or Ask Questions

FairPFN: A Tabular Foundation Model for Causal Fairness

Jun 08, 2025

Jake Robertson, Noah Hollmann, Samuel Müller, Noor Awad, Frank Hutter

Abstract:Machine learning (ML) systems are utilized in critical sectors, such as healthcare, law enforcement, and finance. However, these systems are often trained on historical data that contains demographic biases, leading to ML decisions that perpetuate or exacerbate existing social inequalities. Causal fairness provides a transparent, human-in-the-loop framework to mitigate algorithmic discrimination, aligning closely with legal doctrines of direct and indirect discrimination. However, current causal fairness frameworks hold a key limitation in that they assume prior knowledge of the correct causal model, restricting their applicability in complex fairness scenarios where causal models are unknown or difficult to identify. To bridge this gap, we propose FairPFN, a tabular foundation model pre-trained on synthetic causal fairness data to identify and mitigate the causal effects of protected attributes in its predictions. FairPFN's key contribution is that it requires no knowledge of the causal model and still demonstrates strong performance in identifying and removing protected causal effects across a diverse set of hand-crafted and real-world scenarios relative to robust baseline methods. FairPFN paves the way for promising future research, making causal fairness more accessible to a wider variety of complex fairness problems.

Via

Access Paper or Ask Questions

Do-PFN: In-Context Learning for Causal Effect Estimation

Jun 06, 2025

Jake Robertson, Arik Reuter, Siyuan Guo, Noah Hollmann, Frank Hutter, Bernhard Schölkopf

Abstract:Estimation of causal effects is critical to a range of scientific disciplines. Existing methods for this task either require interventional data, knowledge about the ground truth causal graph, or rely on assumptions such as unconfoundedness, restricting their applicability in real-world settings. In the domain of tabular machine learning, Prior-data fitted networks (PFNs) have achieved state-of-the-art predictive performance, having been pre-trained on synthetic data to solve tabular prediction problems via in-context learning. To assess whether this can be transferred to the harder problem of causal effect estimation, we pre-train PFNs on synthetic data drawn from a wide variety of causal structures, including interventions, to predict interventional outcomes given observational data. Through extensive experiments on synthetic case studies, we show that our approach allows for the accurate estimation of causal effects without knowledge of the underlying causal graph. We also perform ablation studies that elucidate Do-PFN's scalability and robustness across datasets with a variety of causal characteristics.

Via

Access Paper or Ask Questions

Position: The Future of Bayesian Prediction Is Prior-Fitted

May 29, 2025

Samuel Müller, Arik Reuter, Noah Hollmann, David Rügamer, Frank Hutter

Abstract:Training neural networks on randomly generated artificial datasets yields Bayesian models that capture the prior defined by the dataset-generating distribution. Prior-data Fitted Networks (PFNs) are a class of methods designed to leverage this insight. In an era of rapidly increasing computational resources for pre-training and a near stagnation in the generation of new real-world data in many applications, PFNs are poised to play a more important role across a wide range of applications. They enable the efficient allocation of pre-training compute to low-data scenarios. Originally applied to small Bayesian modeling tasks, the field of PFNs has significantly expanded to address more complex domains and larger datasets. This position paper argues that PFNs and other amortized inference approaches represent the future of Bayesian inference, leveraging amortized learning to tackle data-scarce problems. We thus believe they are a fruitful area of research. In this position paper, we explore their potential and directions to address their current limitations.

* Accepted as position paper at ICML 2025

Via

Access Paper or Ask Questions

Drift-Resilient TabPFN: In-Context Learning Temporal Distribution Shifts on Tabular Data

Nov 15, 2024

Kai Helli, David Schnurr, Noah Hollmann, Samuel Müller, Frank Hutter

Figure 1 for Drift-Resilient TabPFN: In-Context Learning Temporal Distribution Shifts on Tabular Data

Figure 2 for Drift-Resilient TabPFN: In-Context Learning Temporal Distribution Shifts on Tabular Data

Figure 3 for Drift-Resilient TabPFN: In-Context Learning Temporal Distribution Shifts on Tabular Data

Figure 4 for Drift-Resilient TabPFN: In-Context Learning Temporal Distribution Shifts on Tabular Data

Abstract:While most ML models expect independent and identically distributed data, this assumption is often violated in real-world scenarios due to distribution shifts, resulting in the degradation of machine learning model performance. Until now, no tabular method has consistently outperformed classical supervised learning, which ignores these shifts. To address temporal distribution shifts, we present Drift-Resilient TabPFN, a fresh approach based on In-Context Learning with a Prior-Data Fitted Network that learns the learning algorithm itself: it accepts the entire training dataset as input and makes predictions on the test set in a single forward pass. Specifically, it learns to approximate Bayesian inference on synthetic datasets drawn from a prior that specifies the model's inductive bias. This prior is based on structural causal models (SCM), which gradually shift over time. To model shifts of these causal models, we use a secondary SCM, that specifies changes in the primary model parameters. The resulting Drift-Resilient TabPFN can be applied to unseen data, runs in seconds on small to moderately sized datasets and needs no hyperparameter tuning. Comprehensive evaluations across 18 synthetic and real-world datasets demonstrate large performance improvements over a wide range of baselines, such as XGB, CatBoost, TabPFN, and applicable methods featured in the Wild-Time benchmark. Compared to the strongest baselines, it improves accuracy from 0.688 to 0.744 and ROC AUC from 0.786 to 0.832 while maintaining stronger calibration. This approach could serve as significant groundwork for further research on out-of-distribution prediction.

* Accepted at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

Via

Access Paper or Ask Questions

Bayes' Power for Explaining In-Context Learning Generalizations

Oct 02, 2024

Samuel Müller, Noah Hollmann, Frank Hutter

Figure 1 for Bayes' Power for Explaining In-Context Learning Generalizations

Figure 2 for Bayes' Power for Explaining In-Context Learning Generalizations

Figure 3 for Bayes' Power for Explaining In-Context Learning Generalizations

Figure 4 for Bayes' Power for Explaining In-Context Learning Generalizations

Abstract:Traditionally, neural network training has been primarily viewed as an approximation of maximum likelihood estimation (MLE). This interpretation originated in a time when training for multiple epochs on small datasets was common and performance was data bound; but it falls short in the era of large-scale single-epoch trainings ushered in by large self-supervised setups, like language models. In this new setup, performance is compute-bound, but data is readily available. As models became more powerful, in-context learning (ICL), i.e., learning in a single forward-pass based on the context, emerged as one of the dominant paradigms. In this paper, we argue that a more useful interpretation of neural network behavior in this era is as an approximation of the true posterior, as defined by the data-generating process. We demonstrate this interpretations' power for ICL and its usefulness to predict generalizations to previously unseen tasks. We show how models become robust in-context learners by effectively composing knowledge from their training data. We illustrate this with experiments that reveal surprising generalizations, all explicable through the exact posterior. Finally, we show the inherent constraints of the generalization capabilities of posteriors and the limitations of neural networks in approximating these posteriors.

Via

Access Paper or Ask Questions

FairPFN: Transformers Can do Counterfactual Fairness

Jul 08, 2024

Jake Robertson, Noah Hollmann, Noor Awad, Frank Hutter

Figure 1 for FairPFN: Transformers Can do Counterfactual Fairness

Figure 2 for FairPFN: Transformers Can do Counterfactual Fairness

Figure 3 for FairPFN: Transformers Can do Counterfactual Fairness

Figure 4 for FairPFN: Transformers Can do Counterfactual Fairness

Abstract:Machine Learning systems are increasingly prevalent across healthcare, law enforcement, and finance but often operate on historical data, which may carry biases against certain demographic groups. Causal and counterfactual fairness provides an intuitive way to define fairness that closely aligns with legal standards. Despite its theoretical benefits, counterfactual fairness comes with several practical limitations, largely related to the reliance on domain knowledge and approximate causal discovery techniques in constructing a causal model. In this study, we take a fresh perspective on counterfactually fair prediction, building upon recent work in in context learning (ICL) and prior fitted networks (PFNs) to learn a transformer called FairPFN. This model is pretrained using synthetic fairness data to eliminate the causal effects of protected attributes directly from observational data, removing the requirement of access to the correct causal model in practice. In our experiments, we thoroughly assess the effectiveness of FairPFN in eliminating the causal impact of protected attributes on a series of synthetic case studies and real world datasets. Our findings pave the way for a new and promising research area: transformers for causal and counterfactual fairness.

Via

Access Paper or Ask Questions

PFNs4BO: In-Context Learning for Bayesian Optimization

Jun 09, 2023

Samuel Müller, Matthias Feurer, Noah Hollmann, Frank Hutter

Figure 1 for PFNs4BO: In-Context Learning for Bayesian Optimization

Figure 2 for PFNs4BO: In-Context Learning for Bayesian Optimization

Figure 3 for PFNs4BO: In-Context Learning for Bayesian Optimization

Figure 4 for PFNs4BO: In-Context Learning for Bayesian Optimization

Abstract:In this paper, we use Prior-data Fitted Networks (PFNs) as a flexible surrogate for Bayesian Optimization (BO). PFNs are neural processes that are trained to approximate the posterior predictive distribution (PPD) through in-context learning on any prior distribution that can be efficiently sampled from. We describe how this flexibility can be exploited for surrogate modeling in BO. We use PFNs to mimic a naive Gaussian process (GP), an advanced GP, and a Bayesian Neural Network (BNN). In addition, we show how to incorporate further information into the prior, such as allowing hints about the position of optima (user priors), ignoring irrelevant dimensions, and performing non-myopic BO by learning the acquisition function. The flexibility underlying these extensions opens up vast possibilities for using PFNs for BO. We demonstrate the usefulness of PFNs for BO in a large-scale evaluation on artificial GP samples and three different hyperparameter optimization testbeds: HPO-B, Bayesmark, and PD1. We publish code alongside trained models at https://github.com/automl/PFNs4BO.

* Accepted at ICML 2023

Via

Access Paper or Ask Questions

GPT for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering

May 05, 2023

Noah Hollmann, Samuel Müller, Frank Hutter

Figure 1 for GPT for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering

Figure 2 for GPT for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering

Figure 3 for GPT for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering

Figure 4 for GPT for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering

Abstract:As the field of automated machine learning (AutoML) advances, it becomes increasingly important to include domain knowledge within these systems. We present an approach for doing so by harnessing the power of large language models (LLMs). Specifically, we introduce Context-Aware Automated Feature Engineering (CAAFE), a feature engineering method for tabular datasets that utilizes an LLM to generate additional semantically meaningful features for tabular datasets based on their descriptions. The method produces both Python code for creating new features and explanations for the utility of the generated features. Despite being methodologically simple, CAAFE enhances performance on 11 out of 14 datasets, ties on 2 and looses on 1 - boosting mean ROC AUC performance from 0.798 to 0.822 across all datasets. On the evaluated datasets, this improvement is similar to the average improvement achieved by using a random forest (AUC 0.782) instead of logistic regression (AUC 0.754). Furthermore, our method offers valuable insights into the rationale behind the generated features by providing a textual explanation for each generated feature. CAAFE paves the way for more extensive (semi-)automation in data science tasks and emphasizes the significance of context-aware solutions that can extend the scope of AutoML systems. For reproducability, we release our code and a simple demo.

Via

Access Paper or Ask Questions

Meta-Learning a Real-Time Tabular AutoML Method For Small Data

Jul 05, 2022

Noah Hollmann, Samuel Müller, Katharina Eggensperger, Frank Hutter

Figure 1 for Meta-Learning a Real-Time Tabular AutoML Method For Small Data

Figure 2 for Meta-Learning a Real-Time Tabular AutoML Method For Small Data

Figure 3 for Meta-Learning a Real-Time Tabular AutoML Method For Small Data

Figure 4 for Meta-Learning a Real-Time Tabular AutoML Method For Small Data

Abstract:We present TabPFN, an AutoML method that is competitive with the state of the art on small tabular datasets while being over 1,000$\times$ faster. Our method is very simple: it is fully entailed in the weights of a single neural network, and a single forward pass directly yields predictions for a new dataset. Our AutoML method is meta-learned using the Transformer-based Prior-Data Fitted Network (PFN) architecture and approximates Bayesian inference with a prior that is based on assumptions of simplicity and causal structures. The prior contains a large space of structural causal models and Bayesian neural networks with a bias for small architectures and thus low complexity. Furthermore, we extend the PFN approach to differentiably calibrate the prior's hyperparameters on real data. By doing so, we separate our abstract prior assumptions from their heuristic calibration on real data. Afterwards, the calibrated hyperparameters are fixed and TabPFN can be applied to any new tabular dataset at the push of a button. Finally, on 30 datasets from the OpenML-CC18 suite we show that our method outperforms boosted trees and performs on par with complex state-of-the-art AutoML systems with predictions produced in less than a second. We provide all our code and our final trained TabPFN in the supplementary materials.

Via

Access Paper or Ask Questions