Journal
ENVIRONMENTAL AND ECOLOGICAL STATISTICS
Volume 29, Issue 4, Pages 677-704Publisher
SPRINGER
DOI: 10.1007/s10651-021-00499-z
Keywords
Case– control sampling; Conditional distribution estimation; Deep learning; Environmental statistics; Machine learning; Nonparametric statistics
Ask authors/readers for more resources
This paper introduces a conditional distribution estimation technique that combines machine learning algorithms to simultaneously estimate the entire conditional distribution and flexibly incorporate machine learning techniques, with the purpose of forecasting tropical cyclone intensity to provide additional insights and influence decision-making. Through simulation studies and real data validation, the effectiveness of the method is demonstrated, with further developments applicable to more complex forecasting and other applications.
Short-term forecasting is an important tool in understanding environmental processes. In this paper, we incorporate machine learning algorithms into a conditional distribution estimator for the purposes of forecasting tropical cyclone intensity. Many machine learning techniques give a single-point prediction of the conditional distribution of the target variable, which does not give a full accounting of the prediction variability. Conditional distribution estimation can provide extra insight on predicted response behavior, which could influence decision-making and policy. We propose a technique that simultaneously estimates the entire conditional distribution and flexibly allows for machine learning techniques to be incorporated. A smooth model is fit over both the target variable and covariates, and a logistic transformation is applied on the model output layer to produce an expression of the conditional density function. We provide two examples of machine learning models that can be used, polynomial regression and deep learning models. To achieve computational efficiency, we propose a case-control sampling approximation to the conditional distribution. A simulation study for four different data distributions highlights the effectiveness of our method compared to other machine learning-based conditional distribution estimation techniques. We then demonstrate the utility of our approach for forecasting purposes using tropical cyclone data from the Atlantic Seaboard. This paper gives a proof of concept for the promise of our method, further computational developments can fully unlock its insights in more complex forecasting and other applications.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available