4.7 Article

Efficient surrogate modeling methods for large-scale Earth system models based on machine-learning techniques

期刊

GEOSCIENTIFIC MODEL DEVELOPMENT
卷 12, 期 5, 页码 1791-1807

出版社

COPERNICUS GESELLSCHAFT MBH
DOI: 10.5194/gmd-12-1791-2019

关键词

-

资金

  1. Scientific Discovery through Advanced Computing (SciDAC) program - U.S. Department of Energy (DOE), Office of Advanced Scientific Computing Research (ASCR)
  2. Office of Biological and Environmental Research (BER)
  3. BER's Terrestrial Ecosystem Science Scientific Focus Area (TES-SFA) project
  4. Oak Ridge National Laboratory - DOE [DE-AC05-00OR22725]

向作者/读者索取更多资源

Improving predictive understanding of Earth system variability and change requires data-model integration. Efficient data-model integration for complex models requires surrogate modeling to reduce model evaluation time. However, building a surrogate of a large-scale Earth system model (ESM) with many output variables is computationally intensive because it involves a large number of expensive ESM simulations. In this effort, we propose an efficient surrogate method capable of using a few ESM runs to build an accurate and fast-to-evaluate surrogate system of model outputs over large spatial and temporal domains. We first use singular value decomposition to reduce the output dimensions and then use Bayesian optimization techniques to generate an accurate neural network surrogate model based on limited ESM simulation samples. Our machine-learning-based surrogate methods can build and evaluate a large surrogate system of many variables quickly. Thus, whenever the quantities of interest change, such as a different objective function, a new site, and a longer simulation time, we can simply extract the information of interest from the surrogate system without rebuilding new surrogates, which significantly reduces computational efforts. We apply the proposed method to a regional ecosystem model to approximate the relationship between eight model parameters and 42 660 carbon flux outputs. Results indicate that using only 20 model simulations, we can build an accurate surrogate system of the 42 660 variables, wherein the consistency between the surrogate prediction and actual model simulation is 0.93 and the mean squared error is 0.02. This highly accurate and fast-to-evaluate surrogate system will greatly enhance the computational efficiency of data-model integration to improve predictions and advance our understanding of the Earth system.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据