4.6 Article

Modeling healthcare data using multiple-channel latent Dirichlet allocation

Journal

JOURNAL OF BIOMEDICAL INFORMATICS
Volume 60, Issue -, Pages 210-223

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2016.02.003

Keywords

Healthcare data mining; Health informatics; Multiple-channel latent Dirichlet allocation; Diagnosis-medication associations; Medication prediction; Diagnosis prediction

Funding

  1. National Science Council of Taiwan [NSC101-3114-Y-002-003]
  2. Ministry of Science and Technology, Taiwan [MOST 103-2410-H-002-108-MY3, MOST 103-2410-H-002-110-MY3, MOST 104-2410-H-002-225-MY3]

Ask authors/readers for more resources

Information and communications technologies have enabled healthcare institutions to accumulate large amounts of healthcare data that include diagnoses, medications, and additional contextual information such as patient demographics. To gain a better understanding of big healthcare data and to develop better data-driven clinical decision support systems, we propose a novel multiple -channel latent Dirichlet allocation (MCLDA) approach for modeling diagnoses, medications, and contextual information in healthcare data. The proposed MCLDA model assumes that a latent health status group structure is responsible for the observed co-occurrences among diagnoses, medications, and contextual information. Using a real-world research testbed that includes one million healthcare insurance claim records, we investigate the utility of MCLDA. Our empirical evaluation results suggest that MCLDA is capable of capturing the comorbidity structures and linking them with the distribution of medications. Moreover, MCLDA is able to identify the pairing between diagnoses and medications in a record based on the assigned latent groups. MCLDA can also be employed to predict missing medications or diagnoses given partial records. Our evaluation results also show that, in most cases, MCLDA outperforms alternative methods such as logistic regressions and the k-nearest-neighbor (KNN) model for two prediction tasks, i.e., medication and diagnosis prediction. Thus, MCLDA represents a promising approach to modeling healthcare data for clinical decision support. (C) 2016 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available