4.6 Article

Estimation of low-density lipoprotein cholesterol levels using machine learning

Journal

INTERNATIONAL JOURNAL OF CARDIOLOGY
Volume 352, Issue -, Pages 144-149

Publisher

ELSEVIER IRELAND LTD
DOI: 10.1016/j.ijcard.2022.01.029

Keywords

Hypercholesterolemia; Low-density lipoprotein cholesterol; Triglycerides; Machine-learning; Cost-effectiveness

Ask authors/readers for more resources

This study proposed a novel method to estimate LDL-C using machine learning, which showed better correlation with directly measured LDL-C compared to conventional methods, especially at high TG levels. Machine learning algorithms were superior in estimating LDL-C compared with the conventional Friedewald or the more contemporary Martin equations, suggesting their potential use in electronic health records.
Background: Low-density lipoprotein-cholesterol (LDL-C) is used as a threshold and target for treating dyslipidemia. Although the Friedewald equation is widely used to estimate LDL-C, it has been known to be inaccurate in the case of high triglycerides (TG) or non-fasting states. We aimed to propose a novel method to estimate LDL-C using machine learning. Methods: Using a large, single-center electronic health record database, we derived a ML algorithm to estimate LDL-C from standard lipid profiles. From 1,029,572 cases with both standard lipid profiles (total cholesterol, high-density lipoprotein-cholesterol, and TG) and direct LDL-C measurements, 823,657 tests were used to derive LDL-C estimation models. Patient characteristics such as sex, age, height, weight, and other laboratory values were additionally used to create separate data sets and algorithms. Results: Machine learning with gradient boosting (LDL-C-X) and neural network (LDL-C-N) showed better correlation with directly measured LDL-C, compared with conventional methods (r = 0.9662, 0.9668, 0.9563, 0.9585; for LDL-C-X, LDL-C-N, Friedewald [LDL-CF], and Martin [LDL-C-M] equations, respectively). The overall bias of LDL-C-X(-0.27 mg/dL, 95% CI-0.30 to-0.23) and LDL-C-N(-0.01 mg/dL, 95% CI-0.04-0.03) were significantly smaller compared with both LDL-C-F (-3.80 mg/dL, 95% CI-3.80 to-3.60) or LDL-C-M (-2.00 mg/dL, 95% CI-2.00 to-1.94), especially at high TG levels. Conclusions: Machine learning algorithms were superior in estimating LDL-C compared with the conventional Friedewald or the more contemporary Martin equations. Through external validation and modification, machine learning could be incorporated into electronic health records to substitute LDL-C estimation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available