4.6 Article

Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
卷 117, 期 538, 页码 969-982

出版社

TAYLOR & FRANCIS INC
DOI: 10.1080/01621459.2020.1833889

关键词

Bayesian; Domain partitioning; Graphical models; Large n; Sparsity; Spatial

资金

  1. NSF [DMS-1513654, IIS-1562303, DMS-1916349]
  2. National Institute of Health [NIEHS-R01ES027027, NIEHS-R01ES030210]
  3. National Science Foundation (NSF) [EF-1253225, DMS-1916395]
  4. National Aeronautics and Space Administration's Carbon Monitoring System project
  5. National Institute of Environmental Health Sciences of the National Institutes of Health [1R01ES028804]
  6. European Union [856506]

向作者/读者索取更多资源

We introduce a class of scalable Bayesian hierarchical models for the analysis of massive geostatistical datasets. The models combine ideas on high-dimensional geostatistics, partitioning the spatial domain, and sparsity-inducing directed acyclic graphs (DAGs). A major contribution is the development of a meshed Gaussian process (MGP) on tessellated domains, along with a Gibbs sampler for efficient recovery of spatial random effects. The cubic MGP (Q-MGP) improves mixing and computing time by executing large-scale operations in parallel within the Gibbs sampler.
We introduce a class of scalable Bayesian hierarchical models for the analysis of massive geostatistical datasets. The underlying idea combines ideas on high-dimensional geostatistics by partitioning the spatial domain and modeling the regions in the partition using a sparsity-inducing directed acyclic graph (DAG). We extend the model over the DAG to a well-defined spatial process, which we call the meshed Gaussian process (MGP). A major contribution is the development of an MGPs on tessellated domains, accompanied by a Gibbs sampler for the efficient recovery of spatial random effects. In particular, the cubic MGP (Q-MGP) can harness high-performance computing resources by executing all large-scale operations in parallel within the Gibbs sampler, improving mixing and computing time compared to sequential updating schemes. Unlike some existing models for large spatial data, a Q-MGP facilitates massive caching of expensive matrix operations, making it particularly apt in dealing with spatiotemporal remote-sensing data. We compare Q-MGPs with large synthetic and real world data against state-of-the-art methods. We also illustrate using Normalized Difference Vegetation Index data from the Serengeti park region to recover latent multivariate spatiotemporal random effects at millions of locations. The source code is available at github.com/mkln/meshgp. for this article are available online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据