4.7 Article

A novel stacking technique for prediction of diabetes

期刊

COMPUTERS IN BIOLOGY AND MEDICINE
卷 135, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compbiomed.2021.104554

关键词

Stacking; Multilayer perceptron; Support vector machines; Logistic regression; AdaBoost classifier

资金

  1. Koneru Lakshmaiah Education Foundation

向作者/读者索取更多资源

This study introduces a novel stacking technique that outperforms other models in terms of model performance when compared with AdaBoost in the context of the PIMA Indian diabetes dataset. The proposed technique integrated intelligent models and led to an improvement in model performance, demonstrating its potential for broader applications in other datasets such as Cleveland heart disease and Wisconsin breast cancer diagnostic datasets.
Background: Machine Learning (ML) represents a rapidly growing technology that supplies the most effective solutions for solving complex problems. The application of ML techniques in healthcare is gaining more attention because of ML-associated automatic pattern identification mechanisms. Diabetes is characterized by hyperglycemia resulting from improper insulin secretion and/or insulin utilization. Methods: The PIMA Indian diabetes dataset was obtained from the University of California/Irvine (UCI) machine learning repository for experimental purposes. The study was carried out in three stages: (1) a correlation technique was developed for feature selection; (2) the AdaBoost technique was implemented on selected features for classification; and (3) a novel stacking technique with multi-layer perceptron, support vector machine, and logistic regression (MLP, SVM, and LR, respectively) was designed and developed for the selected features. Results: The proposed stacking technique integrated the intelligent models and led to an improvement in model performance, thereby overcoming the issue of generating multiple decision stumps by AdaBoost. The proposed novel stacking technique outperformed other models when compared with AdaBoost in terms of performance metrics. The proposed models were then implemented on other datasets, such as the Cleveland heart disease and Wisconsin breast cancer diagnostic datasets, to illustrate their broader applications. Conclusion: Stacking can outperform other models when compared with the other reported techniques that were implemented using the PIMA Indian diabetes dataset.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据