☆ 4.2 Article

Postprocessing of Ensemble Weather Forecast Using Decision Tree-Based Probabilistic Forecasting Methods

WEATHER AND FORECASTING (2023)

期刊

WEATHER AND FORECASTING

卷 38, 期 1, 页码 69-82

出版社

AMER METEOROLOGICAL SOC

DOI: 10.1175/WAF-D-22-0006.1

关键词

Ensembles; Probability forecasts; models; distribution; Short-range prediction; Decision trees; Machine learning

类别

Meteorology & Atmospheric Sciences

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Producing accurate and calibrated probabilistic forecasts is of high social and economic value. Traditional linear regression models have limitations in correcting systematic errors, but tree-based machine learning methods such as quantile random forests and natural gradient boosting perform well in correcting ensemble weather forecasts. Among them, quantile random forests and natural gradient boosting have the best performance for 1-2 day forecasts, while the EMOS model based on ECMWF data outperforms other methods for 8-10 day forecasts.

Producing an accurate and calibrated probabilistic forecast has high social and economic value. Systematic errors or biases in the ensemble weather forecast can be corrected by postprocessing models whose development is an urgent challenge. Traditionally, the bias correction is done by employing linear regression models that estimate the conditional probability distribution of the forecast. Although this model framework works well, it is restricted to a prespecified model form that often relies on a limited set of predictors only. Most machine learning (ML) methods can tackle these problems with a point prediction, but only a few of them can be applied effectively in a probabilistic manner. The tree-based ML techniques, namely, natural gradient boosting (NGB), quantile random forests (QRF), and distributional regression forests (DRF), are used to adjust hourly 2-m temperature ensemble prediction at lead times of 1-10 days. The ensemble model output statistics (EMOS) and its boosting version are used as benchmark models. The model forecast is based on the European Centre for Medium-Range Weather Forecasts (ECMWF) for the Czech Republic domain. Two training periods 2015-18 and 2018 only were used to learn the models, and their prediction skill was evaluated in 2019. The results show that the QRF and NGB methods provide the best performance for 1-2-day forecasts, while the EMOS method outperforms other methods for 8-10-day forecasts. Key components to improving short-term forecasting are additional atmospheric/surface state predictors and the 4-yr training sample size. Significance StatementMachine learning methods have great potential and are beginning to be widely applied in meteorology in recent years. A new technique called natural gradient boosting (NGB) has been released and used in this paper to refine the probabilistic forecast of surface temperature. It was found that the NGB has better prediction skills than the traditional ensemble model output statistics in forecasting 1 and 2 days in advance. The NGB has similar prediction skills with lower computational demands compared to other advanced machine learning methods such as the quantile random forests. We showed a path to employ the NGB method in this task, which can be followed for refining other and more challenging meteorological variables such as wind speed or precipitation.

Postprocessing of Ensemble Weather Forecast Using Decision Tree-Based Probabilistic Forecasting Methods

期刊

WEATHER AND FORECASTING

出版社

AMER METEOROLOGICAL SOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Postprocessing of Ensemble Weather Forecast Using Decision Tree-Based Probabilistic Forecasting Methods

期刊

WEATHER AND FORECASTING

出版社

AMER METEOROLOGICAL SOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文