4.6 Article

Compression and Conditional Emulation of Climate Model Output

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
Volume 113, Issue 521, Pages 56-67

Publisher

AMER STATISTICAL ASSOC
DOI: 10.1080/01621459.2017.1395339

Keywords

Gaussian process; Half-spectral; Nonstationary; Spatial-temporal data; SPDE

Funding

  1. Yellowstone by NCAR's Computational and Information Systems Laboratory - National Science Foundation [ark:/85065/d7wd3xhc]
  2. NSF Research Network on Statistics in the Atmosphere and Ocean Sciences (STATMOS) [DMS-1106862]
  3. National Science Foundation [1406016, 1613219]
  4. Direct For Mathematical & Physical Scien
  5. Division Of Mathematical Sciences [1613219] Funding Source: National Science Foundation
  6. Direct For Mathematical & Physical Scien
  7. Division Of Mathematical Sciences [1406016] Funding Source: National Science Foundation

Ask authors/readers for more resources

Numerical climate model simulations run at high spatial and temporal resolutions generate massive quantities of data. As our computing capabilities continue to increase, storing all of the data is not sustainable, and thus it is important to develop methods for representing the full datasets by smaller compressed versions. We propose a statistical compression and decompression algorithm based on storing a set of summary statistics as well as a statistical model describing the conditional distribution of the full dataset given the summary statistics. We decompress the data by computing conditional expectations and conditional simulations from the model given the summary statistics. Conditional expectations represent our best estimate of the original data but are subject to oversmoothing in space and time. Conditional simulations introduce realistic small-scale noise so that the decompressed fields are neither too smooth nor too rough compared with the original data. Considerable attention is paid to accurately modeling the original dataset1 year of daily mean temperature dataparticularly with regard to the inherent spatial nonstationarity in global fields, and to determining the statistics to be stored, so that the variation in the original data can be closely captured, while allowing for fast decompression and conditional emulation on modest computers. Supplementary materials for this article are available online.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available