4.6 Article

Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires

期刊

SENSORS
卷 21, 期 11, 页码 -

出版社

MDPI
DOI: 10.3390/s21113694

关键词

imbalanced data; burned area; prediction large wildfire; logistic regression; multi-layer perceptron

向作者/读者索取更多资源

Wildfires are increasing in frequency globally, making prediction complex. Machine learning methods for predicting large wildfires face challenges of overfitting or underfitting due to imbalanced datasets. Using synthetic data can improve prediction accuracy, offering a new approach for Decision Support Systems in wildfire management.
Wildfires are becoming more frequent in different parts of the globe, and the ability to predict when and where they will occur is a complex process. Identifying wildfire events with high probability of becoming a large wildfire is an important task for supporting initial attack planning. Different methods, including those that are physics-based, statistical, and based on machine learning (ML) are used in wildfire analysis. Among the whole, those based on machine learning are relatively novel. In addition, because the number of wildfires is much greater than the number of large wildfires, the dataset to be used in a ML model is imbalanced, resulting in overfitting or underfitting the results. In this manuscript, we propose to generate synthetic data from variables of interest together with ML models for the prediction of large wildfires. Specifically, five synthetic data generation methods have been evaluated, and their results are analyzed with four ML methods. The results yield an improvement in the prediction power when synthetic data are used, offering a new method to be taken into account in Decision Support Systems (DSS) when managing wildfires.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据