3.8 Article

Machine learning-driven early biomarker prediction for type 2 diabetes mellitus associated coronary artery diseases

Journal

CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH
Volume 24, Issue -, Pages -

Publisher

ELSEVIER - DIVISION REED ELSEVIER INDIA PVT LTD
DOI: 10.1016/j.cegh.2023.101433

Keywords

Type 2 diabetes; Coronary artery disease; Machine learning; Risk factors; Prediction

Ask authors/readers for more resources

This study implemented a machine learning model to predict the occurrence of type 2 diabetes mellitus (T2DM), coronary artery disease (CAD), and their co-occurrence. Key variables for disease occurrence were identified. The integration of biochemical, physical, and demographic factors allowed for accurate prediction of disease prevalence using machine learning algorithms.
Background: Non-communicable diseases such as type 2 diabetes mellitus (T2DM) and coronary artery disease (CAD) are causing a significant burden on the human health care system. The present study aims to implement a machine learning (ML) based predictive model for T2DM, CAD and their co-occurrence, followed by identifying key variables for the occurrence of disease.Methods: The study is a data driven approach that applied various supervised ML models to predict the disease occurrence based on biochemical, demographic and physical data collected from 123 subjects. In addition, performance metrics like accuracy and AUC were used to evaluate the classification accuracy of ML models. Results: The data (n = 123 subjects) consist of male (83) and female (40) populace with ages from 35 to 70 years. Among all ML models, the Random Forest was outperformed with an accuracy of 76% (AUC:0.95). Similarly, in T2DM (AUC:0.92) and T2DM + CAD (AUC:0.94) classification, Random Forest achieved the highest accuracy followed by logistic regression for CAD data (AUC:0.98). The major risk factors recognized for T2DM, CAD and T2DM + CAD are HbA1c, FBS, CK-MB, APO.AII, APO.E, IP.10, and total cholesterol. Conclusion: The results suggest that the ML algorithms predict the prevalence of T2DM, CAD and concomitance in the populace through the integration of biochemical, physical and demographic factors. Based on the identified risk contributors for disease occurrence, preventive measures can be drafted to reduce the disease burden.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available