4.6 Article

On mining latent topics from healthcare chat logs

Journal

JOURNAL OF BIOMEDICAL INFORMATICS
Volume 61, Issue -, Pages 247-259

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2016.04.008

Keywords

Social media analysis; Latent Dirichlet allocation; Healthcare chat group; Topic discovery

Funding

  1. National Nature Science Foundation of China [81101126]

Ask authors/readers for more resources

Background: Public and internet-based social media such as online healthcare-oriented chat groups provide a convenient channel for patients and people concerned about health to communicate and share information with each other. The chat logs of an online healthcare-oriented chat group can potentially be used to extract latent topics, to encourage participation, and to recommend relevant healthcare information to users. Objective: This paper addresses the use of online healthcare chat logs to automatically discover both underlying topics and user interests. Method: We present a new probabilistic model that exploits healthcare chat logs to find hidden topics and changes in these topics over time. The proposed model uses separate but associated hidden variables to explore both topics and individual interests such that it can provide useful insights to the participants of online healthcare chat groups about their interests in terms of weighted topics or vice versa. Results: We evaluate the proposed model on a real-world chat log by comparing its performance to benchmark topic models, i.e., latent Dirichlet allocation (LDA) and Author Topic Model (ATM), on the topic extraction task The chat log is obtained from an online chat group of pregnant women, which consists of 233,452 chat word tokens contributed by 118 users. Both detected individual interests and underlying topics with their progressive information over time are demonstrated. The results show that the performance of the proposed model exceeds that of the benchmark models. Conclusion: The experimental results illustrate that the proposed model is a promising method for extracting healthcare knowledge from social media data. (C) 2016 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available