4.6 Article

Application of Machine Learning to Predict COVID-19 Spread via an Optimized BPSO Model

Journal

BIOMIMETICS
Volume 8, Issue 6, Pages -

Publisher

MDPI
DOI: 10.3390/biomimetics8060457

Keywords

k-nearest neighbor; binary particle swarm optimization; random oversampling; random forest model; gradient boosting model; naive Bayes model

Ask authors/readers for more resources

This study presents an enhanced model for predicting COVID-19 samples in different regions of Saudi Arabia. The model utilizes binary particle swarm optimization for feature selection and evaluates the performance of three machine learning models. The results show that the gradient boosting model performs better in high-altitude areas, while the random forest model performs better in sea-level areas.
During the pandemic of the coronavirus disease (COVID-19), statistics showed that the number of affected cases differed from one country to another and also from one city to another. Therefore, in this paper, we provide an enhanced model for predicting COVID-19 samples in different regions of Saudi Arabia (high-altitude and sea-level areas). The model is developed using several stages and was successfully trained and tested using two datasets that were collected from Taif city (high-altitude area) and Jeddah city (sea-level area) in Saudi Arabia. Binary particle swarm optimization (BPSO) is used in this study for making feature selections using three different machine learning models, i.e., the random forest model, gradient boosting model, and naive Bayes model. A number of predicting evaluation metrics including accuracy, training score, testing score, F-measure, recall, precision, and receiver operating characteristic (ROC) curve were calculated to verify the performance of the three machine learning models on these datasets. The experimental results demonstrated that the gradient boosting model gives better results than the random forest and naive Bayes models with an accuracy of 94.6% using the Taif city dataset. For the dataset of Jeddah city, the results demonstrated that the random forest model outperforms the gradient boosting and naive Bayes models with an accuracy of 95.5%. The dataset of Jeddah city achieved better results than the dataset of Taif city in Saudi Arabia using the enhanced model for the term of accuracy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available