☆ 4.5 Article

Estimating extremely large amounts of missing precipitation data

JOURNAL OF HYDROINFORMATICS (2020)

期刊

JOURNAL OF HYDROINFORMATICS

卷 22, 期 3, 页码 578-592

出版社

IWA PUBLISHING

DOI: 10.2166/hydro.2020.127

关键词

evaluation; large missing precipitation; multiple imputation; random forest; spatio-temporal kriging

类别

Computer Science, Interdisciplinary Applications Engineering, Civil Environmental Sciences Water Resources

资金

CLIGRO Project (MICINN) of the Spanish National Plan for Scientific and Technical Research and Innovation [CGL2016-77473-C3-1-R]
National System of Youth Guarantee [PEJ-2014-85121]
Youth Employment Initiative (YEI)
European Social Fund (ESF)
Ministry of Education, Youth and Sport of the Community of Madrid [IND2017/AMB7789]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Accurate estimation of missing daily precipitation data remains a difficult task. A wide variety of methods exists for infilling missing values, but the percentage of gaps is one of the main factors limiting their applicability. The present study compares three techniques for filling in large amounts of missing daily precipitation data: spatio-temporal kriging (STK), multiple imputation by chained equations through predictive mean matching (PMM), and the random forest (RF) machine learning algorithm. To our knowledge, this is the first time that extreme missingness (>90%) has been considered. Different percentages of missing data and missing patterns are tested in a large dataset drawn from 112 rain gauges in the period 1975-2017. The results show that both STK and RF can handle extreme missingness, while PMM requires larger observed sample sizes. STK is the most robust method, suitable for chronological missing patterns. RF is efficient under random missing patterns. Model evaluation is usually based on performance and error measures. However, this study outlines the risk of just relying on these measures without checking for consistency. The RF algorithm overestimated daily precipitation outside the validation period in some cases due to the overdetection of rainy days under time-dependent missing patterns.

Estimating extremely large amounts of missing precipitation data

期刊

JOURNAL OF HYDROINFORMATICS

出版社

IWA PUBLISHING

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Estimating extremely large amounts of missing precipitation data

期刊

JOURNAL OF HYDROINFORMATICS

出版社

IWA PUBLISHING

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文