4.6 Article

Machine Learning to Develop and Internally Validate a Predictive Model for Post-operative Delirium in a Prospective, Observational Clinical Cohort Study of Older Surgical Patients

期刊

JOURNAL OF GENERAL INTERNAL MEDICINE
卷 36, 期 2, 页码 265-273

出版社

SPRINGER
DOI: 10.1007/s11606-020-06238-7

关键词

machine learning; statistical learning; model prediction; delirium; post-operative

资金

  1. Alzheimer's Drug Discovery Foundation
  2. National Institutes of Health [P01AG031720, K07AG041835, R24AG054259, R01AG044518, R01AG030618, K24AG035075, T32AG023480, K01AG057836, R03AG061582]
  3. Alzheimer's Association [AARF-18-560786]

向作者/读者索取更多资源

The study assessed the performance of machine learning methods to predict post-operative delirium, finding that algorithms in large feature set conditions performed better than selected feature set conditions, though calibration for all models and feature sets was poor.
Background Our objective was to assess the performance of machine learning methods to predict post-operative delirium using a prospective clinical cohort. Methods We analyzed data from an observational cohort study of 560 older adults (>= 70 years) without dementia undergoing major elective non-cardiac surgery. Post-operative delirium was determined by the Confusion Assessment Method supplemented by a medical chart review (N = 134, 24%). Five machine learning algorithms and a standard stepwise logistic regression model were developed in a training sample (80% of participants) and evaluated in the remaining hold-out testing sample. We evaluated three overlapping feature sets, restricted to variables that are readily available or minimally burdensome to collect in clinical settings, including interview and medical record data. A large feature set included 71 potential predictors. A smaller set of 18 features was selected by an expert panel using a consensus process, and this smaller feature set was considered with and without a measure of pre-operative mental status. Results The area under the receiver operating characteristic curve (AUC) was higher in the large feature set conditions (range of AUC, 0.62-0.71 across algorithms) versus the selected feature set conditions (AUC range, 0.53-0.57). The restricted feature set with mental status had intermediate AUC values (range, 0.53-0.68). In the full feature set condition, algorithms such as gradient boosting, cross-validated logistic regression, and neural network (AUC = 0.71, 95% CI 0.58-0.83) were comparable with a model developed using traditional stepwise logistic regression (AUC = 0.69, 95% CI 0.57-0.82). Calibration for all models and feature sets was poor. Conclusions We developed machine learning prediction models for post-operative delirium that performed better than chance and are comparable with traditional stepwise logistic regression. Delirium proved to be a phenotype that was difficult to predict with appreciable accuracy.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据