Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joseph Guinness

A Framework for Nonstationary Gaussian Processes with Neural Network Parameters

Jul 16, 2025

Zachary James, Joseph Guinness

Abstract:Gaussian processes have become a popular tool for nonparametric regression because of their flexibility and uncertainty quantification. However, they often use stationary kernels, which limit the expressiveness of the model and may be unsuitable for many datasets. We propose a framework that uses nonstationary kernels whose parameters vary across the feature space, modeling these parameters as the output of a neural network that takes the features as input. The neural network and Gaussian process are trained jointly using the chain rule to calculate derivatives. Our method clearly describes the behavior of the nonstationary parameters and is compatible with approximation methods for scaling to large datasets. It is flexible and easily adapts to different nonstationary kernels without needing to redesign the optimization procedure. Our methods are implemented with the GPyTorch library and can be readily modified. We test a nonstationary variance and noise variant of our method on several machine learning datasets and find that it achieves better accuracy and log-score than both a stationary model and a hierarchical model approximated with variational inference. Similar results are observed for a model with only nonstationary variance. We also demonstrate our approach's ability to recover the nonstationary parameters of a spatial dataset.

Via

Access Paper or Ask Questions

Implementation and Analysis of GPU Algorithms for Vecchia Approximation

Jul 03, 2024

Zachary James, Joseph Guinness

Figure 1 for Implementation and Analysis of GPU Algorithms for Vecchia Approximation

Figure 2 for Implementation and Analysis of GPU Algorithms for Vecchia Approximation

Figure 3 for Implementation and Analysis of GPU Algorithms for Vecchia Approximation

Figure 4 for Implementation and Analysis of GPU Algorithms for Vecchia Approximation

Abstract:Gaussian Processes have become an indispensable part of the spatial statistician's toolbox but are unsuitable for analyzing large dataset because of the significant time and memory needed to fit the associated model exactly. Vecchia Approximation is widely used to reduce the computational complexity and can be calculated with embarrassingly parallel algorithms. While multi-core software has been developed for Vecchia Approximation, such as the GpGp R package, software designed to run on graphics processing units (GPU) is lacking, despite the tremendous success GPUs have had in statistics and machine learning. We compare three different ways to implement Vecchia Approximation on a GPU: two of which are similar to methods used for other Gaussian Process approximations and one that is new. The impact of memory type on performance is investigated and the final method is optimized accordingly. We show that our new method outperforms the other two and then present it in the GpGpU R package. We compare GpGpU to existing multi-core and GPU-accelerated software by fitting Gaussian Process models on various datasets, including a large spatial-temporal dataset of $n>10^6$ points collected from an earth-observing satellite. Our results show that GpGpU achieves faster runtimes and better predictive accuracy.

Via

Access Paper or Ask Questions

Scalable Gaussian-process regression and variable selection using Vecchia approximations

Mar 02, 2022

Jian Cao, Joseph Guinness, Marc G. Genton, Matthias Katzfuss

Figure 1 for Scalable Gaussian-process regression and variable selection using Vecchia approximations

Figure 2 for Scalable Gaussian-process regression and variable selection using Vecchia approximations

Figure 3 for Scalable Gaussian-process regression and variable selection using Vecchia approximations

Figure 4 for Scalable Gaussian-process regression and variable selection using Vecchia approximations

Abstract:Gaussian process (GP) regression is a flexible, nonparametric approach to regression that naturally quantifies uncertainty. In many applications, the number of responses and covariates are both large, and a goal is to select covariates that are related to the response. For this setting, we propose a novel, scalable algorithm, coined VGPR, which optimizes a penalized GP log-likelihood based on the Vecchia GP approximation, an ordered conditional approximation from spatial statistics that implies a sparse Cholesky factor of the precision matrix. We traverse the regularization path from strong to weak penalization, sequentially adding candidate covariates based on the gradient of the log-likelihood and deselecting irrelevant covariates via a new quadratic constrained coordinate descent algorithm. We propose Vecchia-based mini-batch subsampling, which provides unbiased gradient estimators. The resulting procedure is scalable to millions of responses and thousands of covariates. Theoretical analysis and numerical studies demonstrate the improved scalability and accuracy relative to existing methods.

* 28 pages, 8 figures

Via

Access Paper or Ask Questions

Scaled Vecchia approximation for fast computer-model emulation

May 29, 2020

Matthias Katzfuss, Joseph Guinness, Earl Lawrence

Figure 1 for Scaled Vecchia approximation for fast computer-model emulation

Figure 2 for Scaled Vecchia approximation for fast computer-model emulation

Figure 3 for Scaled Vecchia approximation for fast computer-model emulation

Figure 4 for Scaled Vecchia approximation for fast computer-model emulation

Abstract:Many scientific phenomena are studied using computer experiments consisting of multiple runs of a computer model while varying the input settings. Gaussian processes (GPs) are a popular tool for the analysis of computer experiments, enabling interpolation between input settings, but direct GP inference is computationally infeasible for large datasets. We adapt and extend a powerful class of GP methods from spatial statistics to enable the scalable analysis and emulation of large computer experiments. Specifically, we apply Vecchia's ordered conditional approximation in a transformed input space, with each input scaled according to how strongly it relates to the computer-model response. The scaling is learned from the data, by estimating parameters in the GP covariance function using Fisher scoring. Our methods are highly scalable, enabling estimation, joint prediction and simulation in near-linear time in the number of model runs. In several numerical examples, our approach substantially outperformed existing methods.

* R code available at https://github.com/katzfuss-group/scaledVecchia

Via

Access Paper or Ask Questions

Inverses of Matern Covariances on Grids

Dec 26, 2019

Joseph Guinness

Figure 1 for Inverses of Matern Covariances on Grids

Figure 2 for Inverses of Matern Covariances on Grids

Figure 3 for Inverses of Matern Covariances on Grids

Figure 4 for Inverses of Matern Covariances on Grids

Abstract:We conduct a theoretical and numerical study of the aliased spectral densities and inverse operators of Mat\'ern covariance functions on regular grids. We apply our results to provide clarity on the properties of a popular approximation based on stochastic partial differential equations; we find that it can approximate the aliased spectral density and the covariance operator well as the grid spacing goes to zero, but it does not provide increasingly accurate approximations to the inverse operator as the grid spacing goes to zero. If a sparse approximation to the inverse is desired, we suggest instead to select a KL-divergence-minimizing sparse approximation and demonstrate in simulations that these sparse approximations deliver accurate Mat\'ern parameter estimates, while the SPDE approximation over-estimates spatial dependence.

Via

Access Paper or Ask Questions

Gaussian Process Learning via Fisher Scoring of Vecchia's Approximation

May 20, 2019

Joseph Guinness

Figure 1 for Gaussian Process Learning via Fisher Scoring of Vecchia's Approximation

Figure 2 for Gaussian Process Learning via Fisher Scoring of Vecchia's Approximation

Figure 3 for Gaussian Process Learning via Fisher Scoring of Vecchia's Approximation

Abstract:We derive a single pass algorithm for computing the gradient and Fisher information of Vecchia's Gaussian process loglikelihood approximation, which provides a computationally efficient means for applying the Fisher scoring algorithm for maximizing the loglikelihood. The advantages of the optimization techniques are demonstrated in numerical examples and in an application to Argo ocean temperature data. The new methods are more accurate and much faster than an optimization method that uses only function evaluations, especially when the covariance function has many parameters. This allows practitioners to fit nonstationary models to large spatial and spatial-temporal datasets.

Via

Access Paper or Ask Questions