4.7 Article

Interpretable vs. noninterpretable machine learning models for data-driven hydro-climatological process modeling ®

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 170, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2020.114498

关键词

Deep learning; Boosting; Transfer learning; Hydroclimate; Reference crop evapotranspiration; Model explainability

向作者/读者索取更多资源

This study compared the predictive capabilities of interpretable and noninterpretable machine learning models, revealing that tree-based ensemble models can perform similarly to deep learning models in structured hydro-climatological datasets. Using a newly developed sequential transfer-learning technique, the tree-based ensemble model was able to impute missing climate data at various levels. The eXML framework quantified the global importance of hydro-climatic variables and identified transition points of climate variables for daily ETo rates.
Due to their enhanced predictive capabilities, noninterpretable machine learning (ML) models (e.g. deep learning) have recently gained a growing interest in analyzing and modeling earth & planetary science data. However, noninterpretable ML models are often treated as ?black boxes? by end-users, which could limit their applicability in critical decision making processes. In this paper, we compared the predictive capabilities of three interpretable ML models with three noninterpretable ML models to answer the overarching question: Is it essential to use noninterpretable ML models for enhanced model predictions from hydro-climatological datasets? The ML model development and comparative analysis were performed using measured climate data and synthetic reference crop evapotranspiration (ETo) data, with varying levels of missing values, from five weather stations across the karstic Edwards aquifer region in semi-arid south-central Texas. Our analysis revealed that interpretable tree based ensemble models produce comparable results to noninterpretable deep learning models on structured hydro-climatological datasets. We showed that the tree-based ensemble model is also capable of imputing varying levels of missing climate data at the weather stations, employing the newly developed sequential transfer-learning technique. We applied an explainable machine learning (eXML) framework to quantify the global order of importance of hydro-climatic (predictor) variables on ETo, while highlighting the local dependencies and interactions amongst the predictors and ETo. The eXML framework also revealed the inflection points of the climate variables at which the transition from low to high daily ETo rates occur. The ancillary explainability of ML models are expected to increase users? confidence and support any future decision-making process in water resource management.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据