🌙

smashGP:基于无矩阵高斯过程的大规模空间建模

smashGP: Large-Scale Spatial Modeling via Matrix-Free Gaussian Processes

Journal of Computational and Graphical Statistics · 2024
被引 1
ABS 3

中文导读

提出smashGP框架,利用层次矩阵和无矩阵方法,在O(n log n)时间内完成大规模空间数据的高斯过程建模与预测,支持并行计算。

Abstract

Gaussian processes are essential for spatial data analysis. Not only do they allow the prediction of unknown values, but they also allow for uncertainty quantification. However, in the era of big data, directly using Gaussian processes has become computationally infeasible as cubic run times are required for dense matrix decomposition and inversion. Various alternatives have been proposed to reduce the computational burden of directly fitting Gaussian processes. These alternatives rely on assumptions on the underlying structure of the covariance or precision matrices, such as sparsity or low-rank. In contrast, this article uses hierarchical matrices and matrix-free methods to enable the computation of Gaussian processes for large spatial datasets by exploiting the underlying kernel properties. The proposed framework, smashGP, represents the covariance matrix as an H2 matrix in O(n) time and is able to estimate the unknown parameters of the model and predict the values of spatial observations at unobserved locations in O(n log n) time thanks to fast matrix-vector products. Additionally, it can be parallelized to take full advantage of shared-memory computing environments. With simulations and case studies, we illustrate the advantage of smashGP to model large-scale spatial datasets. Supplementary materials for this article are available online.

空间数据分析高斯过程大规模计算层次矩阵