4.4 Article

Improving Artificial Neural Network Based Streamflow Forecasting Models through Data Preprocessing

期刊

KSCE JOURNAL OF CIVIL ENGINEERING
卷 25, 期 9, 页码 3583-3595

出版社

KOREAN SOCIETY OF CIVIL ENGINEERS-KSCE
DOI: 10.1007/s12205-021-1859-y

关键词

Hydrological data; Data preprocessing; Box-Cox transformation; ANN models; Gamma statistics

向作者/读者索取更多资源

Real-time hydrological data can be noisy, contain missing information, and deviate from its original scale due to the complex and nonlinear nature of hydrological processes. This study introduces a simple preprocessing approach, involving power transformations, Box-Cox transformations, and input variable selection through the Gamma Test, to improve the performance of ANN-based streamflow estimation models. The results show that the models developed with transformed data-sets outperform those developed with original data in terms of NSE, R-2, and other statistical errors, indicating that simple preprocessing options can significantly reduce uncertainty in ANN-based hydrological models by enhancing the quality of real-time hydrological data.
The real time hydrological data may contain noise, missing information and deviation from its original scale due to complex and nonlinear nature of hydrological processes. The data when used as it is in hydrological forecasting may create uncertainty in hydrological models, especially in data-driven models which fully rely upon the input-output data. The current research provides a simple preprocessing approach to improve the performance of ANN-based streamflow estimation models through providing a better input state. The two-step preprocessing approach includes; the data transformation through a family of power transformation, the Box-Cox transformation, and the selection of appropriate input variables through the Gamma Test. The original data, which is essentially antecedent upland catchment information of thirteen stations located in Upper Indus Basin (UIB), comprises of twenty inputs, including precipitation, solar radiation and discharge. The Box-Cox transformation has been applied to prepare a transformed data-set and the power factor, lambda, (with best value of 0.005), for this transformation, has been determined using probability plots and histogram characteristics. Input combination selection procedure is carried out in WinGamma environment with the help of Genetic Algorithm (GA). Two-layer ANN models have been trained through Broyden, Fletcher and Goldfrab Shano (BFGS) training algorithm for both original and transformed data-sets. The comparison of models clearly indicate that the models developed through transformed data-set showed better performance in both training and testing phases with high values of NSE and R-2 which is above 90% in most of the cases, and less other statistical errors including RMSE, VARIANCE and BIAS. Simple preprocessing options, could significantly reduce the uncertainty in ANN based hydrological models through improving the quality of real time hydrological data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据