4.2 Article

Postprocessing of Ensemble Weather Forecast Using Decision Tree-Based Probabilistic Forecasting Methods

期刊

WEATHER AND FORECASTING
卷 38, 期 1, 页码 69-82

出版社

AMER METEOROLOGICAL SOC
DOI: 10.1175/WAF-D-22-0006.1

关键词

Ensembles; Probability forecasts; models; distribution; Short-range prediction; Decision trees; Machine learning

向作者/读者索取更多资源

Producing accurate and calibrated probabilistic forecasts is of high social and economic value. Traditional linear regression models have limitations in correcting systematic errors, but tree-based machine learning methods such as quantile random forests and natural gradient boosting perform well in correcting ensemble weather forecasts. Among them, quantile random forests and natural gradient boosting have the best performance for 1-2 day forecasts, while the EMOS model based on ECMWF data outperforms other methods for 8-10 day forecasts.
Producing an accurate and calibrated probabilistic forecast has high social and economic value. Systematic errors or biases in the ensemble weather forecast can be corrected by postprocessing models whose development is an urgent challenge. Traditionally, the bias correction is done by employing linear regression models that estimate the conditional probability distribution of the forecast. Although this model framework works well, it is restricted to a prespecified model form that often relies on a limited set of predictors only. Most machine learning (ML) methods can tackle these problems with a point prediction, but only a few of them can be applied effectively in a probabilistic manner. The tree-based ML techniques, namely, natural gradient boosting (NGB), quantile random forests (QRF), and distributional regression forests (DRF), are used to adjust hourly 2-m temperature ensemble prediction at lead times of 1-10 days. The ensemble model output statistics (EMOS) and its boosting version are used as benchmark models. The model forecast is based on the European Centre for Medium-Range Weather Forecasts (ECMWF) for the Czech Republic domain. Two training periods 2015-18 and 2018 only were used to learn the models, and their prediction skill was evaluated in 2019. The results show that the QRF and NGB methods provide the best performance for 1-2-day forecasts, while the EMOS method outperforms other methods for 8-10-day forecasts. Key components to improving short-term forecasting are additional atmospheric/surface state predictors and the 4-yr training sample size. Significance StatementMachine learning methods have great potential and are beginning to be widely applied in meteorology in recent years. A new technique called natural gradient boosting (NGB) has been released and used in this paper to refine the probabilistic forecast of surface temperature. It was found that the NGB has better prediction skills than the traditional ensemble model output statistics in forecasting 1 and 2 days in advance. The NGB has similar prediction skills with lower computational demands compared to other advanced machine learning methods such as the quantile random forests. We showed a path to employ the NGB method in this task, which can be followed for refining other and more challenging meteorological variables such as wind speed or precipitation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据