☆ 4.5 Article

Feature extraction from unequal length heterogeneous EHR time series via dynamic time warping and tensor decomposition

DATA MINING AND KNOWLEDGE DISCOVERY (2021)

期刊

DATA MINING AND KNOWLEDGE DISCOVERY

卷 35, 期 4, 页码 1760-1784

出版社

SPRINGER

DOI: 10.1007/s10618-020-00724-6

关键词

Electronic health records; Dynamic time warping; Tensor decomposition; Patient similarity

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study proposes a method for handling irregularly sampled and unequal length electronic health record time series using dynamic time warping and tensor decomposition to learn the latent structure of patient data for patient representation and in-hospital mortality prediction. The research demonstrates outstanding classification performance on two patient cohorts from the MIMIC-III database and provides a detailed analysis of feature importance.

Electronic Health Records (EHR) data is routinely generated patient data that can provide useful information for analytical tasks such as disease detection and clinical event prediction. However, temporal EHR data such as physiological vital signs and lab test results are particularly challenging. Temporal EHR features typically have different sampling frequencies; such examples include heart rate (measured almost continuously) and blood test results (a few times during a patient's entire stay). Different patients also have different length of stays. Existing approaches for temporal EHR sequence extraction either ignore the temporal pattern within features, or use a predefined window to select a section of the sequences without taking into account all the information. We propose a novel approach to tackle the issue of irregularly sampled, unequal length EHR time series using dynamic time warping and tensor decomposition. We use DTW to learn the pairwise distances for each temporal feature among the patient cohort and stack the distance matrices into a tensor. We then decompose the tensor to learn the latent structure, which is consequently used for patient representation. Finally, we use the patient representation for in-hospital mortality prediction. We illustrate our method on two cohorts from the MIMIC-III database: the sepsis and the acute kidney failure cohorts. We show that our method produces outstanding classification performance in terms of AUROC, AUPRC and accuracy compared with the baseline methods: LSTM and DTW-KNN. In the end we provide a detailed analysis on the feature importance for the interpretability of our method.

Feature extraction from unequal length heterogeneous EHR time series via dynamic time warping and tensor decomposition

期刊

DATA MINING AND KNOWLEDGE DISCOVERY

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Feature extraction from unequal length heterogeneous EHR time series via dynamic time warping and tensor decomposition

期刊

DATA MINING AND KNOWLEDGE DISCOVERY

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文