☆ 4.7 Article

Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study

DIGITAL HEALTH (2023)

期刊

DIGITAL HEALTH

卷 9, 期 -, 页码 -

出版社

SAGE PUBLICATIONS LTD

DOI: 10.1177/20552076231187605

关键词

Cardiac surgery; risk prediction; machine learning; mortality; ensemble learning; dynamic model averaging; legacy scores; multi-modal data

类别

Health Care Sciences & Services Health Policy & Services Public, Environmental & Occupational Health Medical Informatics

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study combines EuroSCORE I and EuroSCORE II data using ensemble learning methods to predict cardiac surgery risk. The results show that both homogeneous and heterogeneous machine learning ensembles perform better than dynamic model averaging ensembles.

ObjectiveThe introduction of new clinical risk scores (e.g. European System for Cardiac Operative Risk Evaluation (EuroSCORE) II) superseding original scores (e.g. EuroSCORE I) with different variable sets typically result in disparate datasets due to high levels of missingness for new score variables prior to time of adoption. Little is known about the use of ensemble learning to incorporate disparate data from legacy scores. We tested the hypothesised that Homogenenous and Heterogeneous Machine Learning (ML) ensembles will have better performance than ensembles of Dynamic Model Averaging (DMA) for combining knowledge from EuroSCORE I legacy data with EuroSCORE II data to predict cardiac surgery risk. MethodsUsing the National Adult Cardiac Surgery Audit dataset, we trained 12 different base learner models, based on two different variable sets from either EuroSCORE I (LogES) or EuroScore II (ES II), partitioned by the time of score adoption (1996-2016 or 2012-2016) and evaluated on holdout set (2017-2019). These base learner models were ensembled using nine different combinations of six ML algorithms to produce homogeneous or heterogeneous ensembles. Performance was assessed using a consensus metric. ResultsXgboost homogenous ensemble (HE) was the highest performing model (clinical effectiveness metric (CEM) 0.725) with area under the curve (AUC) (0.8327; 95% confidence interval (CI) 0.8323-0.8329) followed by Random Forest HE (CEM 0.723; AUC 0.8325; 95%CI 0.8320-0.8326). Across different heterogenous ensembles, significantly better performance was obtained by combining siloed datasets across time (CEM 0.720) than building ensembles of either 1996-2011 (t-test adjusted, p = 1.67x10(-6)) or 2012-2019 (t-test adjusted, p = 1.35x10(-193)) datasets alone. ConclusionsBoth homogenous and heterogenous ML ensembles performed significantly better than DMA ensemble of Bayesian Update models. Time-dependent ensemble combination of variables, having differing qualities according to time of score adoption, enabled previously siloed data to be combined, leading to increased power, clinical interpretability of variables and usage of data.

Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study

期刊

DIGITAL HEALTH

出版社

SAGE PUBLICATIONS LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study

期刊

DIGITAL HEALTH

出版社

SAGE PUBLICATIONS LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文