4.7 Article

Toward a Multi-Representational Approach to Prediction and Understanding, in Support of Discovery in Hydrology

期刊

WATER RESOURCES RESEARCH
卷 59, 期 1, 页码 -

出版社

AMER GEOPHYSICAL UNION
DOI: 10.1029/2021WR031548

关键词

representation; machine learning; LSTM; Random Forest; GR4J; conceptual model; lumped water balance model; understanding; discovery; hydrological processes; catchments; hydro-geo-climatology

向作者/读者索取更多资源

The selection of a suitable representational system is key to model development, as it determines the questions that can be asked, the analyses and inferences that can be made, and the answers that can be obtained. This paper explores three representational strategies for understanding catchment scale hydrological processes, and finds that each approach has its own strengths, ultimately supporting improved understanding and prediction.
Key to model development is the selection of an appropriate representational system, including both the representation of what is observed (the data), and the formal mathematical structure used to construct the input-state-output mapping. These choices are critical, because they completely determine the questions we can ask, the nature of the analyses and inferences we can perform, and the answers we can obtain. Accordingly, a representation that is suitable for one kind of investigation might be limited in its ability to support some other kind. Arguably, how different representational approaches affect what we can learn from data is poorly understood. This paper explores three representational strategies as vehicles for understanding how catchment scale hydrological processes vary across hydro-geo-climatologically diverse Chile. Specifically, we test a lumped water-balance model (GR4J), a data-based dynamical systems model (LSTM), and a data-based regression tree model (Random Forest). Insights were obtained regarding system memory encoded in data, spatial transferability by use of surrogate attributes, and informational deficiencies of the data set that limit our ability to learn an adequate input-output relationship. As expected, each approach exhibits specific strengths, with LSTM providing the best characterization of dynamics, GR4J being the most robust under informationally deficient conditions, and Random Forest regression-tree method being most supportive of interpretation. Overall, the contrasting nature of the three approaches suggests the value of adopting a multi-representational framework to more fully extract information from the data and, by doing so, find information that better facilities the goals of robust prediction and improved understanding, ultimately supporting enhanced scientific discovery.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据