☆ 4.6 Article

Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups

JOURNAL OF THE AMERICAN HEART ASSOCIATION (2021)

期刊

JOURNAL OF THE AMERICAN HEART ASSOCIATION

卷 10, 期 23, 页码 -

出版社

WILEY

DOI: 10.1161/JAHA.121.021976

关键词

cluster analysis; coronary artery disease; machine learning; phenotype discovery

类别

Cardiac & Cardiovascular Systems

资金

American Heart Association, Dallas, TX [EIA34770065]
National Institutes of Health/National Heart, Lung, and Blood Institute, Bethesda, MD [R35HL144475, 1K01HL148639-01, R01HL075774]
National Institutes of Health/National Library of Medicine, Bethesda, MD [R01LM011369-06]
Deutsche Herzstiftung
2018-2019 Society of University Surgeons Junior Faculty Award, Los Angeles, CA
National Institutes of Health/National Heart, Lung, and Blood Institute [K12HL087746]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study used unsupervised machine learning to analyze data of patients with coronary artery disease and identified four subgroups with different clinical trajectories. In terms of risk assessment, cluster membership was more informative than the pooled cohort equations.

BACKGROUND: The promise of precision population health includes the ability to use robust patient data to tailor prevention and care to specific groups. Advanced analytics may allow for automated detection of clinically informative subgroups that account for clinical, genetic, and environmental variability. This study sought to evaluate whether unsupervised machine learning approaches could interpret heterogeneous and missing clinical data to discover clinically important coronary artery disease subgroups. METHODS AND RESULTS: The Genetic Determinants of Peripheral Arterial Disease study is a prospective cohort that includes individuals with newly diagnosed and/or symptomatic coronary artery disease. We applied generalized low rank modeling and K--means cluster analysis using 155 phenotypic and genetic variables from 1329 participants. Cox proportional hazard models were used to examine associations between clusters and major adverse cardiovascular and cerebrovascular events and all-cause mortality. We then compared performance of risk stratification based on clusters and the American College of Cardiology/American Heart Association pooled cohort equations. Unsupervised analysis identified 4 phenotypically and prognostically distinct clusters. All-cause mortality was highest in cluster 1 (oldest/most comorbid; 26%), whereas major adverse cardiovascular and cerebrovascular event rates were highest in cluster 2 (youngest/multiethnic; 41%). Cluster 4 (middle-aged/ healthiest behaviors) experienced more incident major adverse cardiovascular and cerebrovascular events (30%) than cluster 3 (middle-aged/lowest medication adherence; 23%), despite apparently similar risk factor and lifestyle profiles. In comparison with the pooled cohort equations, cluster membership was more informative for risk assessment of myocardial infarction, stroke, and mortality. CONCLUSIONS: Unsupervised clustering identified 4 unique coronary artery disease subgroups with distinct clinical trajectories. Flexible unsupervised machine learning algorithms offer the ability to meaningfully process heterogeneous patient data and provide sharper insights into disease characterization and risk assessment.

Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups

期刊

JOURNAL OF THE AMERICAN HEART ASSOCIATION

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups

期刊

JOURNAL OF THE AMERICAN HEART ASSOCIATION

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文