4.4 Article

Improved Cardiovascular Risk Prediction Using Nonparametric Regression and Electronic Health Record Data

期刊

MEDICAL CARE
卷 51, 期 3, 页码 251-258

出版社

LIPPINCOTT WILLIAMS & WILKINS
DOI: 10.1097/MLR.0b013e31827da594

关键词

cardiovascular disease; electronic health record; Framingham risk score; machine learning; nonparametric regression; risk prediction

资金

  1. VA Health Services Research & Development Service's Quality Enhancement Research Initiative (VA QUERI)
  2. Methods Core of Grant (MCDTR) from the National Institute of Diabetes and Digestive and Kidney Diseases [P30DK092926]

向作者/读者索取更多资源

Background: Use of the electronic health record (EHR) is expected to increase rapidly in the near future, yet little research exists on whether analyzing internal EHR data using flexible, adaptive statistical methods could improve clinical risk prediction. Extensive implementation of EHR in the Veterans Health Administration provides an opportunity for exploration. Objectives: To compare the performance of various approaches for predicting risk of cerebrovascular and cardiovascular (CCV) death, using traditional risk predictors versus more comprehensive EHR data. Research Design: Retrospective cohort study. We identified all Veterans Health Administration patients without recent CCV events treated at 12 facilities from 2003 to 2007, and predicted risk using the Framingham risk score, logistic regression, generalized additive modeling, and gradient tree boosting. Measures: The outcome was CCV-related death within 5 years. We assessed each method's predictive performance with the area under the receiver operating characteristic curve (AUC), the Hosmer-Lemeshow goodness-of-fit test, plots of estimated risk, and reclassification tables, using cross-validation to penalize overfitting. Results: Regression methods outperformed the Framingham risk score, even with the same predictors (AUC increased from 71% to 73% and calibration also improved). Even better performance was attained in models using additional EHR-derived predictor variables (AUC increased to 78% and net reclassification improvement was as large as 0.29). Nonparametric regression further improved calibration and discrimination compared with logistic regression. Conclusions: Despite the EHR lacking some risk factors and its imperfect data quality, health care systems may be able to substantially improve risk prediction for their patients by using internally developed EHR-derived models and flexible statistical methodology.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据