3.9 Article

An ensemble data mining approach to discover medical patterns and provide a system to predict the mortality in the ICU of cardiac surgery based on stacking machine learning method

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/21681163.2022.2063189

Keywords

Classification; stacking ensemble method; heart surgery; unbalanced data problem; hybrid predictive model; machine learning in healthcare; resampling method; edited-nearest-neighbor; nonparametric test

Ask authors/readers for more resources

This research utilizes machine learning and data mining techniques to build a stacking predictive model for predicting the mortality after heart surgery. By using feature importance and a combination of sampling algorithms, the introduced model achieves higher accuracy and efficiency compared to other models.
The most effective approach to reduce disease mortality is to diagnose it as soon as possible. As a result, data mining by applying machine learning in the field of diseases provides good opportunities to examine the hidden patterns of this collection. An exact forecast of the mortality after heart surgery will cause successful medical treatment and fewer costs. This research wants to recommend a new stacking predictive model after utilising the random forest feature importance method to foresee the mortality after heart surgery on a highly unbalanced dataset by using the most practical features. To solve the unbalanced data problem, a combination of the SVM-SMOTE over-sampling algorithm and the Edited-Nearest-Neighbour under-sampling algorithm is used. This research compares the introduced model with some other machine learning classifiers to ensure efficiency through shuffle hold-out and 10-fold cross-validation strategies. In order to validate the performance of the implemented machine learning methods in this research, both shuffle hold-out, and 10-fold cross-validation results indicated that our model had the highest efficiency compared to the other models. Furthermore, the Friedman statistical test is applied to survey the differences between models. The result demonstrates that the introduced stacking model reached the most accurate predicting performance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.9
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available