4.6 Article

Parkinson's Disease Detection Using Filter Feature Selection and a Genetic Algorithm with Ensemble Learning

Journal

DIAGNOSTICS
Volume 13, Issue 17, Pages -

Publisher

MDPI
DOI: 10.3390/diagnostics13172816

Keywords

Parkinson's disease (PD); filter feature selection; ensemble learning; genetic selection

Ask authors/readers for more resources

This study investigates the impact of filter feature selection, followed by ensemble learning methods and genetic selection, on the detection of Parkinson's disease patients based on attributes extracted from voice clips. Two distinct datasets were used, and various classification models were tested on the filtered data. Decision tree, random forest, and XGBoost classifiers achieved remarkable results, especially on Dataset 1 with 100% accuracy by decision tree and random forest. Ensemble learning methods and genetic selection further improved the precision of the predictions.
Parkinson's disease (PD) is a neurodegenerative disorder marked by motor and non-motor symptoms that have a severe impact on the quality of life of the affected individuals. This study explores the effect of filter feature selection, followed by ensemble learning methods and genetic selection, on the detection of PD patients from attributes extracted from voice clips from both PD patients and healthy patients. Two distinct datasets were employed in this study. Filter feature selection was carried out by eliminating quasi-constant features. Several classification models were then tested on the filtered data. Decision tree, random forest, and XGBoost classifiers produced remarkable results, especially on Dataset 1, where 100% accuracy was achieved by decision tree and random forest. Ensemble learning methods (voting, stacking, and bagging) were then applied to the best-performing models to see whether the results could be enhanced further. Additionally, genetic selection was applied to the filtered data and evaluated using several classification models for their accuracy and precision. It was found that in most cases, the predictions for PD patients showed more precision than those for healthy individuals. The overall performance was also better on Dataset 1 than on Dataset 2, which had a greater number of features.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available