4.0 Article

Automated Analysis of the US Drought Monitor Maps With Machine Learning and Multiple Drought Indicators

Journal

FRONTIERS IN BIG DATA
Volume 4, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA
DOI: 10.3389/fdata.2021.750536

Keywords

USDM; machine learning; drought monitoring; logistic regression; random forest; SVM-support vector machines; drought indices

Funding

  1. National Science Foundation [NSF-1615612, NSF-2006633]
  2. NOAA [NA17OAR4310132]

Ask authors/readers for more resources

The US Drought Monitor is a vital tool for real-time drought monitoring, but its production involves human judgment, making it difficult for others to reproduce the maps. This study developed a framework using machine learning to automatically generate similar maps, with the support vector machines algorithm and specific data group achieving near-perfect reproduction accuracy.
The US Drought Monitor (USDM) is a hallmark in real time drought monitoring and assessment as it was developed by multiple agencies to provide an accurate and timely assessment of drought conditions in the US on a weekly basis. The map is built based on multiple physical indicators as well as reported observations from local contributors before human analysts combine the information and produce the drought map using their best judgement. Since human subjectivity is included in the production of the USDM maps, it is not an entirely clear quantitative procedure for other entities to reproduce the maps. In this study, we developed a framework to automatically generate the maps through a machine learning approach by predicting the drought categories across the domain of study. A persistence model served as the baseline model for comparison in the framework. Three machine learning algorithms, logistic regression, random forests, and support vector machines, with four different groups of input data, which formed an overall of 12 different configurations, were used for the prediction of drought categories. Finally, all the configurations were evaluated against the baseline model to select the best performing option. The results showed that our proposed framework could reproduce the drought maps to a near-perfect level with the support vector machines algorithm and the group 4 data. The rest of the findings of this study can be highlighted as: 1) employing the past week drought data as a predictor in the models played an important role in achieving high prediction scores, 2) the nonlinear models, random forest, and support vector machines had a better overall performance compared to the logistic regression models, and 3) with borrowing the neighboring grid cells information, we could compensate the lack of training data in the grid cells with insufficient historical USDM data particularly for extreme and exceptional drought conditions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.0
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available