4.5 Article

Kryging: geostatistical analysis of large-scale datasets using Krylov subspace methods

Journal

STATISTICS AND COMPUTING
Volume 32, Issue 5, Pages -

Publisher

SPRINGER
DOI: 10.1007/s11222-022-10104-3

Keywords

Approximate inference; Profile likelihood; Block Toeplitz matrix; Fast Fourier transform; Krylov subspace methods; Golub-Kahan bidiagonalization

Funding

  1. National Science Foundation [DMS-1845406, DMS-1638521]
  2. National Institute of Health [R01ES031651-01, R01ES027892]
  3. King Abdullah University of Science and Technology [3800.2]

Ask authors/readers for more resources

In this study, we propose a novel approximate inference methodology for analyzing massive spatial datasets using a Gaussian process model. Our method effectively estimates spatial covariance parameters and makes accurate predictions with uncertainty quantification for point-referenced spatial data. It is applicable to various types of observations and covariance functions. Extensive simulation studies and a real data application demonstrate that our method significantly reduces computation time while maintaining scalability.
Analyzing massive spatial datasets using a Gaussian process model poses computational challenges. This is a problem prevailing heavily in applications such as environmental modeling, ecology, forestry and environmental health. We present a novel approximate inference methodology that uses profile likelihood and Krylov subspace methods to estimate the spatial covariance parameters and makes spatial predictions with uncertainty quantification for point-referenced spatial data. Kryging combines Kriging and Krylov subspace methods and applies for both observations on regular grid and irregularly spaced observations, and for any Gaussian process with a stationary isotropic (and certain geometrically anisotropic) covariance function, including the popular Matern covariance family. We make use of the block Toeplitz structure with Toeplitz blocks of the covariance matrix and use fast Fourier transform methods to bypass the computational and memory bottlenecks of approximating log-determinant and matrix-vector products. We perform extensive simulation studies to show the effectiveness of our model by varying sample sizes, spatial parameter values and sampling designs. A real data application is also performed on a dataset consisting of land surface temperature readings taken by the MODIS satellite. Compared to existing methods, the proposed method performs satisfactorily with much less computation time and better scalability.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available