4.6 Article

Learning the progression patterns of treatments using a probabilistic generative model

期刊

JOURNAL OF BIOMEDICAL INFORMATICS
卷 137, 期 -, 页码 -

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2022.104271

关键词

Disease progression modeling; Electronic health records; Markov model; Probabilistic generative model; Unsupervised machine learning

向作者/读者索取更多资源

This paper presents a probabilistic generative model for disease modeling and patient treatment based on Electronic Health Records. The model aims to identify different subtypes of treatments for a given disease and discover their development and progression. It considers the hierarchical structure of latent variables to classify and segment the treatment sequences. The model's learning procedure is efficiently solved with the Expectation-Maximization algorithm based on dynamic programming. The evaluation includes recovering the generative model underlying synthetic data and assessing the model's ability to provide treatment classification and staging information in real-world data. The model can be used for classification, simulation, data augmentation, and missing data imputation.
Modeling a disease or the treatment of a patient has drawn much attention in recent years due to the vast amount of information that Electronic Health Records contain. This paper presents a probabilistic generative model of treatments that are described in terms of sequences of medical activities of variable length. The main objective is to identify distinct subtypes of treatments for a given disease, and discover their development and progression. To this end, the model considers that a sequence of actions has an associated hierarchical structure of latent variables that both classifies the sequences based on their evolution over time, and segments the sequences into different progression stages. The learning procedure of the model is performed with the Expectation-Maximization algorithm which considers the exponential number of configurations of the latent variables and is efficiently solved with a method based on dynamic programming. The evaluation of the model is twofold: first, we use synthetic data to demonstrate that the learning procedure allows the generative model underlying the data to be recovered; we then further assess the potential of our model to provide treatment classification and staging information in real-world data. Our model can be seen as a tool for classification, simulation, data augmentation and missing data imputation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据