4.6 Article

Learning from mistakes-Assessing the performance and uncertainty in process-based models

期刊

HYDROLOGICAL PROCESSES
卷 36, 期 2, 页码 -

出版社

WILEY
DOI: 10.1002/hyp.14515

关键词

explainable; machine learning; process-based modelling; stream temperature

资金

  1. Austrian Science Fund [P 31213]
  2. Global Water Futures
  3. Natural Sciences and Engineering Research Council of Canada

向作者/读者索取更多资源

This study presents a method and workflow for analyzing errors in process-based models to improve process understanding and prediction. By using machine learning models and explanatory analysis, model errors can be predicted and grouped, leading to hypotheses on error generation. In a case study of a mountain stream model, clustering analysis resulted in three distinct error groups related to shading and vegetation radiation. These findings enhance trust in implemented processes and provide valuable information for model improvement.
Typical applications of process- or physically-based models aim to gain a better process understanding or provide the basis for a decision-making process. To adequately represent the physical system, models should include all essential processes. However, model errors can still occur. Other than large systematic observation errors, simplified, misrepresented, inadequately parametrised or missing processes are potential sources of errors. This study presents a set of methods and a proposed workflow for analysing errors of process-based models as a basis for relating them to process representations. The evaluated approach consists of three steps: (1) training a machine-learning (ML) error model using the input data of the process-based model and other available variables, (2) estimation of local explanations (i.e., contributions of each variable to an individual prediction) for each predicted model error using SHapley Additive exPlanations (SHAP) in combination with principal component analysis, (3) clustering of SHAP values of all predicted errors to derive groups with similar error generation characteristics. By analysing these groups of different error-variable association, hypotheses on error generation and corresponding processes can be formulated. That can ultimately lead to improvements in process understanding and prediction. The approach is applied to a process-based stream water temperature model HFLUX in a case study for modelling an alpine stream in the Canadian Rocky Mountains. By using available meteorological and hydrological variables as inputs, the applied ML model is able to predict model residuals. Clustering of SHAP values results in three distinct error groups that are mainly related to shading and vegetation-emitted long wave radiation. Model errors are rarely random and often contain valuable information. Assessing model error associations is ultimately a way of enhancing trust in implemented processes and of providing information on potential areas of improvement to the model.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据