4.7 Article

Data-driven identification of ageing-related diseases from electronic health records

期刊

SCIENTIFIC REPORTS
卷 11, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41598-021-82459-y

关键词

-

资金

  1. Wellcome Trust [WT 206274/Z/17/Z, MR/K006584/1, WT 110284/Z/15/Z]
  2. Rosetrees and Stoneygate Trust
  3. Wellcome Trust Strategic Award
  4. Max Planck Society
  5. UK Medical Research Council [MR/N013867/1]
  6. National Institute for Health Research University College London Hospitals Biomedical Research Centre
  7. British Heart Foundation Accelerator Award [AA/18/6/24223]
  8. Medical Research Council
  9. Arthritis Research UK
  10. British Heart Foundation
  11. Cancer Research UK
  12. Chief Scientist Office
  13. Economic and Social Research Council
  14. Engineering and Physical Sciences Research Council
  15. National Institute for Health Research
  16. National Institute for Social Care and Health Research
  17. UK Medical Research Council
  18. Department of Health and Social Care (England)
  19. Chief Scientist Office of the Scottish Government Health and Social Care Directorates
  20. Health and Social Care Research and Development Division (Welsh Government)
  21. Public Health Agency (Northern Ireland)
  22. Wellcome Trust
  23. UKRI Innovation/Rutherford Fellowship
  24. Sir Henry Wellcome Postdoctoral Fellowship from the Wellcome Trust [WT 201375/Z/16/Z]
  25. Alan Turing Fellowship
  26. MRC [MR/S003754/1, G0902393, MR/M501633/2, 1764958] Funding Source: UKRI

向作者/读者索取更多资源

Reducing the burden of late-life morbidity requires understanding the mechanisms of ageing-related diseases. The study proposed a framework using machine learning and actuarial techniques to identify and cluster ageing-related diseases, resulting in 207 diseases being categorized into four clusters based on different age ranges.
Reducing the burden of late-life morbidity requires an understanding of the mechanisms of ageing-related diseases (ARDs), defined as diseases that accumulate with increasing age. This has been hampered by the lack of formal criteria to identify ARDs. Here, we present a framework to identify ARDs using two complementary methods consisting of unsupervised machine learning and actuarial techniques, which we applied to electronic health records (EHRs) from 3,009,048 individuals in England using primary care data from the Clinical Practice Research Datalink (CPRD) linked to the Hospital Episode Statistics admitted patient care dataset between 1 April 2010 and 31 March 2015 (mean age 49.7 years (s.d. 18.6), 51% female, 70% white ethnicity). We grouped 278 high-burden diseases into nine main clusters according to their patterns of disease onset, using a hierarchical agglomerative clustering algorithm. Four of these clusters, encompassing 207 diseases spanning diverse organ systems and clinical specialties, had rates of disease onset that clearly increased with chronological age. However, the ages of onset for these four clusters were strikingly different, with median age of onset 82 years (IQR 82-83) for Cluster 1, 77 years (IQR 75-77) for Cluster 2, 69 years (IQR 66-71) for Cluster 3 and 57 years (IQR 54-59) for Cluster 4. Fitting to ageing-related actuarial models confirmed that the vast majority of these 207 diseases had a high probability of being ageing-related. Cardiovascular diseases and cancers were highly represented, while benign neoplastic, skin and psychiatric conditions were largely absent from the four ageing-related clusters. Our framework identifies and clusters ARDs and can form the basis for fundamental and translational research into ageing pathways.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据