☆ 4.7 Article

Interpretable vs. noninterpretable machine learning models for data-driven hydro-climatological process modeling ®

EXPERT SYSTEMS WITH APPLICATIONS (2021)

Journal

EXPERT SYSTEMS WITH APPLICATIONS

Volume 170, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.eswa.2020.114498

Keywords

Deep learning; Boosting; Transfer learning; Hydroclimate; Reference crop evapotranspiration; Model explainability

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study compared the predictive capabilities of interpretable and noninterpretable machine learning models, revealing that tree-based ensemble models can perform similarly to deep learning models in structured hydro-climatological datasets. Using a newly developed sequential transfer-learning technique, the tree-based ensemble model was able to impute missing climate data at various levels. The eXML framework quantified the global importance of hydro-climatic variables and identified transition points of climate variables for daily ETo rates.

Due to their enhanced predictive capabilities, noninterpretable machine learning (ML) models (e.g. deep learning) have recently gained a growing interest in analyzing and modeling earth & planetary science data. However, noninterpretable ML models are often treated as ?black boxes? by end-users, which could limit their applicability in critical decision making processes. In this paper, we compared the predictive capabilities of three interpretable ML models with three noninterpretable ML models to answer the overarching question: Is it essential to use noninterpretable ML models for enhanced model predictions from hydro-climatological datasets? The ML model development and comparative analysis were performed using measured climate data and synthetic reference crop evapotranspiration (ETo) data, with varying levels of missing values, from five weather stations across the karstic Edwards aquifer region in semi-arid south-central Texas. Our analysis revealed that interpretable tree based ensemble models produce comparable results to noninterpretable deep learning models on structured hydro-climatological datasets. We showed that the tree-based ensemble model is also capable of imputing varying levels of missing climate data at the weather stations, employing the newly developed sequential transfer-learning technique. We applied an explainable machine learning (eXML) framework to quantify the global order of importance of hydro-climatic (predictor) variables on ETo, while highlighting the local dependencies and interactions amongst the predictors and ETo. The eXML framework also revealed the inflection points of the climate variables at which the transition from low to high daily ETo rates occur. The ancillary explainability of ML models are expected to increase users? confidence and support any future decision-making process in water resource management.

Interpretable vs. noninterpretable machine learning models for data-driven hydro-climatological process modeling ®

Journal

EXPERT SYSTEMS WITH APPLICATIONS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Interpretable vs. noninterpretable machine learning models for data-driven hydro-climatological process modeling ®

Journal

EXPERT SYSTEMS WITH APPLICATIONS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper