Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonathan P. Williams

Conflict Forecasting via Conformal Prediction for Markov Processes

Apr 28, 2026

Aditya Basarkar, Emmett B. Kendall, David Randahl, Jonathan P. Williams, Gudmund H. Hermansen

Abstract:Whether or not a country is at war, or experiencing escalating or deescalating levels of conflict, has massive ramifications on a country's national and foreign policy. Given a country's history of conflict, or lack thereof, future predictions about the war-status of a country are valuable information. In this paper, we present the use of conformal prediction on temporally-dependent data to obtain prediction sets of possible future conflict state-sequences. More specifically, we compare the results of conformal prediction to a likelihood-based prediction strategy when the data are assumed to come from a discrete-state Markov process. A point-prediction may not supply sufficient information because the penalty for a wrong prediction is extreme, and so we consider a machine learning alternative that gives valid uncertainty quantification and is robust to model misspecification. In the data analysis, we present real forecasts of conflict dynamics across multiple countries. Lastly, we comment on the possible limitations of existing approaches for applying conformal prediction to Markovian data, where the exchangeability assumption is violated.

Via

Access Paper or Ask Questions

Identification of physiological shock in intensive care units via Bayesian regime switching models

Mar 23, 2026

Emmett B. Kendall, Jonathan P. Williams, Curtis B. Storlie, Misty A. Radosevich, Erica D. Wittwer, Matthew A. Warner

Abstract:Detection of occult hemorrhage (i.e., internal bleeding) in patients in intensive care units (ICUs) can pose significant challenges for critical care workers. Because blood loss may not always be clinically apparent, clinicians rely on monitoring vital signs for specific trends indicative of a hemorrhage event. The inherent difficulties of diagnosing such an event can lead to late intervention by clinicians which has catastrophic consequences. Therefore, a methodology for early detection of hemorrhage has wide utility. We develop a Bayesian regime switching model (RSM) that analyzes trends in patients' vitals and labs to provide a probabilistic assessment of the underlying physiological state that a patient is in at any given time. This article is motivated by a comprehensive dataset we curated from Mayo Clinic of 33,924 real ICU patient encounters. Longitudinal response measurements are modeled as a vector autoregressive process conditional on all latent states up to the current time point, and the latent states follow a Markov process. We present a novel Bayesian sampling routine to learn the posterior probability distribution of the latent physiological states, as well as develop an approach to account for pre-ICU-admission physiological changes. A simulation and real case study illustrate the effectiveness of our approach.

Via

Access Paper or Ask Questions

Multivariate and Online Transfer Learning with Uncertainty Quantification

Nov 19, 2024

Jimmy Hickey, Jonathan P. Williams, Brian J. Reich, Emily C. Hector

Abstract:Untreated periodontitis causes inflammation within the supporting tissue of the teeth and can ultimately lead to tooth loss. Modeling periodontal outcomes is beneficial as they are difficult and time consuming to measure, but disparities in representation between demographic groups must be considered. There may not be enough participants to build group specific models and it can be ineffective, and even dangerous, to apply a model to participants in an underrepresented group if demographic differences were not considered during training. We propose an extension to RECaST Bayesian transfer learning framework. Our method jointly models multivariate outcomes, exhibiting significant improvement over the previous univariate RECaST method. Further, we introduce an online approach to model sequential data sets. Negative transfer is mitigated to ensure that the information shared from the other demographic groups does not negatively impact the modeling of the underrepresented participants. The Bayesian framework naturally provides uncertainty quantification on predictions. Especially important in medical applications, our method does not share data between domains. We demonstrate the effectiveness of our method in both predictive performance and uncertainty quantification on simulated data and on a database of dental records from the HealthPartners Institute.

* 30 pages, 1 figure, 17 tables

Via

Access Paper or Ask Questions

Bin-Conditional Conformal Prediction of Fatalities from Armed Conflict

Oct 18, 2024

David Randahl, Jonathan P. Williams, Håvard Hegre

Abstract:Forecasting of armed conflicts is an important area of research that has the potential to save lives and prevent suffering. However, most existing forecasting models provide only point predictions without any individual-level uncertainty estimates. In this paper, we introduce a novel extension to conformal prediction algorithm which we call bin-conditional conformal prediction. This method allows users to obtain individual-level prediction intervals for any arbitrary prediction model while maintaining a specific level of coverage across user-defined ranges of values. We apply the bin-conditional conformal prediction algorithm to forecast fatalities from armed conflict. Our results demonstrate that the method provides well-calibrated uncertainty estimates for the predicted number of fatalities. Compared to standard conformal prediction, the bin-conditional method outperforms offers improved calibration of coverage rates across different values of the outcome, but at the cost of wider prediction intervals.

Via

Access Paper or Ask Questions

Uncertainty quantification in automated valuation models with locally weighted conformal prediction

Dec 11, 2023

Anders Hjort, Gudmund Horn Hermansen, Johan Pensar, Jonathan P. Williams

Figure 1 for Uncertainty quantification in automated valuation models with locally weighted conformal prediction

Figure 2 for Uncertainty quantification in automated valuation models with locally weighted conformal prediction

Figure 3 for Uncertainty quantification in automated valuation models with locally weighted conformal prediction

Figure 4 for Uncertainty quantification in automated valuation models with locally weighted conformal prediction

Abstract:Non-parametric machine learning models, such as random forests and gradient boosted trees, are frequently used to estimate house prices due to their predictive accuracy, but such methods are often limited in their ability to quantify prediction uncertainty. Conformal Prediction (CP) is a model-agnostic framework for constructing confidence sets around machine learning prediction models with minimal assumptions. However, due to the spatial dependencies observed in house prices, direct application of CP leads to confidence sets that are not calibrated everywhere, i.e., too large of confidence sets in certain geographical regions and too small in others. We survey various approaches to adjust the CP confidence set to account for this and demonstrate their performance on a data set from the housing market in Oslo, Norway. Our findings indicate that calibrating the confidence sets on a \textit{locally weighted} version of the non-conformity scores makes the coverage more consistently calibrated in different geographical regions. We also perform a simulation study on synthetically generated sale prices to empirically explore the performance of CP on housing market data under idealized conditions with known data-generating mechanisms.

Via

Access Paper or Ask Questions

Valid Inference for Machine Learning Model Parameters

Feb 21, 2023

Neil Dey, Jonathan P. Williams

Abstract:The parameters of a machine learning model are typically learned by minimizing a loss function on a set of training data. However, this can come with the risk of overtraining; in order for the model to generalize well, it is of great importance that we are able to find the optimal parameter for the model on the entire population -- not only on the given training sample. In this paper, we construct valid confidence sets for this optimal parameter of a machine learning model, which can be generated using only the training data without any knowledge of the population. We then show that studying the distribution of this confidence set allows us to assign a notion of confidence to arbitrary regions of the parameter space, and we demonstrate that this distribution can be well-approximated using bootstrapping techniques.

* 25 pages, 5 figures

Via

Access Paper or Ask Questions

Transfer Learning with Uncertainty Quantification: Random Effect Calibration of Source to Target (RECaST)

Nov 29, 2022

Jimmy Hickey, Jonathan P. Williams, Emily C. Hector

Figure 1 for Transfer Learning with Uncertainty Quantification: Random Effect Calibration of Source to Target (RECaST)

Figure 2 for Transfer Learning with Uncertainty Quantification: Random Effect Calibration of Source to Target (RECaST)

Figure 3 for Transfer Learning with Uncertainty Quantification: Random Effect Calibration of Source to Target (RECaST)

Figure 4 for Transfer Learning with Uncertainty Quantification: Random Effect Calibration of Source to Target (RECaST)

Abstract:Transfer learning uses a data model, trained to make predictions or inferences on data from one population, to make reliable predictions or inferences on data from another population. Most existing transfer learning approaches are based on fine-tuning pre-trained neural network models, and fail to provide crucial uncertainty quantification. We develop a statistical framework for model predictions based on transfer learning, called RECaST. The primary mechanism is a Cauchy random effect that recalibrates a source model to a target population; we mathematically and empirically demonstrate the validity of our RECaST approach for transfer learning between linear models, in the sense that prediction sets will achieve their nominal stated coverage, and we numerically illustrate the method's robustness to asymptotic approximations for nonlinear models. Whereas many existing techniques are built on particular source models, RECaST is agnostic to the choice of source model. For example, our RECaST transfer learning approach can be applied to a continuous or discrete data model with linear or logistic regression, deep neural network architectures, etc. Furthermore, RECaST provides uncertainty quantification for predictions, which is mostly absent in the literature. We examine our method's performance in a simulation study and in an application to real hospital data.

* 26 pages, 2 figures, 4 tables

Via

Access Paper or Ask Questions