3.8 Article

Interpretable machine learning prediction of all-cause mortality

Journal

COMMUNICATIONS MEDICINE
Volume 2, Issue 1, Pages -

Publisher

SPRINGERNATURE
DOI: 10.1038/s43856-022-00180-x

Keywords

-

Funding

  1. National Science Foundation [DBI-1759487, DBI-1552309, DBI-1355899, DGE-1762114]
  2. National Institutes of Health [R35 GM 128638, R01 NIA AG 061132, P30 AG 013280]

Ask authors/readers for more resources

This paper analyzes all-cause mortality using complex machine learning models and proposes a framework called IMPACT to explain mortality prediction. The study shows that the IMPACT models achieve higher accuracy than linear models and neural networks. By applying the IMPACT framework to the NHANES dataset, the authors identify overlooked risk factors and develop interpretable mortality risk scores. The study aims to bring explainable artificial intelligence to epidemiology.
Background Unlike linear models which are traditionally used to study all-cause mortality, complex machine learning models can capture non-linear interrelations and provide opportunities to identify unexplored risk factors. Explainable artificial intelligence can improve prediction accuracy over linear models and reveal great insights into outcomes like mortality. This paper comprehensively analyzes all-cause mortality by explaining complex machine learning models. Methods We propose the IMPACT framework that uses XAI technique to explain a state-of-the-art tree ensemble mortality prediction model. We apply IMPACT to understand all-cause mortality for 1-, 3-, 5-, and 10-year follow-up times within the NHANES dataset, which contains 47,261 samples and 151 features. Results We show that IMPACT models achieve higher accuracy than linear models and neural networks. Using IMPACT, we identify several overlooked risk factors and interaction effects. Furthermore, we identify relationships between laboratory features and mortality that may suggest adjusting established reference intervals. Finally, we develop highly accurate, efficient and interpretable mortality risk scores that can be used by medical professionals and individuals without medical expertise. We ensure generalizability by performing temporal validation of the mortality risk scores and external validation of important findings with the UK Biobank dataset. Conclusions IMPACT's unique strength is the explainable prediction, which provides insights into the complex, non-linear relationships between mortality and features, while maintaining high accuracy. Our explainable risk scores could help individuals improve self-awareness of their health status and help clinicians identify patients with high risk. IMPACT takes a consequential step towards bringing contemporary developments in XAI to epidemiology.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available