4.7 Article

Rainfall forecasting model using machine learning methods: Case study Terengganu, Malaysia

期刊

AIN SHAMS ENGINEERING JOURNAL
卷 12, 期 2, 页码 1651-1663

出版社

ELSEVIER
DOI: 10.1016/j.asej.2020.09.011

关键词

Forecasting rainfall; Machine learning algorithms; Boosted decision tree regression; Decision forest regression; Neural network regression; Bayesian linear regression

资金

  1. TNB Seed Fund [U-TG-RD-19-01]

向作者/读者索取更多资源

In this study, various Machine Learning models and methods were used to predict rainfall data in Tasik Kenyir, Terengganu, Malaysia. The results showed that ML models had higher accuracy compared to M2 in different time horizons, with the BDTR model performing the best.
Rainfall plays a main role in managing the water level in the reservoir. The unpredictable amount of rainfall due to the climate change can cause either overflow or dry in the reservoir. In this study, several models and methods were applied to predict the rainfall data in Tasik Kenyir, Terengganu. The comparative study was conducted focusing on developing and comparing several Machine Learning (ML) models, evaluating different scenarios and time horizon, and forecasting rainfall using two types of methods. Data involved for this research consist of taking the average rainfall from 10 stations around the study area using Thiessen polygon to weight the station area and projected rainfall. The forecasting model uses four different ML algorithms, which are Bayesian Linear Regression (BLR), Boosted Decision Tree Regression (BDTR), Decision Forest Regression (DFR) and Neural Network Regression (NNR). On the other hand, the rainfall was predicted on different time horizon by using different ML's algorithms which is method 1 (M1): Forecasting Rainfall Using Autocorrelation Function (ACF) and method 2 (M2): Forecasting Rainfall Using Projected Error. In Ml, the best regression developed for ACF is BDTR since it has the highest coefficient of determination, R2, after tuning the hyperparameter. The results show coefficient between 0.5 and 0.9 with the highest of each scenarios for daily (0.9739693), weekly (0.989461), 10days (0.9894429) and monthly (0.9998085). In M2, overall model performances show that normalization using LogNormal is preferably giving a good result of each categories except for 10-days with BDTR and DFR are the most acceptable result than NNR and BLR. It is concluded that, two different methods have been applied with different scenarios and different time horizons, and Ml shows a rather high accuracy than M2 using BDTR modeling. (C) 2020 The Authors. Published by Elsevier B.V. on behalf of Faculty of Engineering, Ain Shams University.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据