☆ 4.6 Article

On mining latent topics from healthcare chat logs

JOURNAL OF BIOMEDICAL INFORMATICS (2016)

Journal

JOURNAL OF BIOMEDICAL INFORMATICS

Volume 61, Issue -, Pages 247-259

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.jbi.2016.04.008

Keywords

Social media analysis; Latent Dirichlet allocation; Healthcare chat group; Topic discovery

Funding

National Nature Science Foundation of China [81101126]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Background: Public and internet-based social media such as online healthcare-oriented chat groups provide a convenient channel for patients and people concerned about health to communicate and share information with each other. The chat logs of an online healthcare-oriented chat group can potentially be used to extract latent topics, to encourage participation, and to recommend relevant healthcare information to users. Objective: This paper addresses the use of online healthcare chat logs to automatically discover both underlying topics and user interests. Method: We present a new probabilistic model that exploits healthcare chat logs to find hidden topics and changes in these topics over time. The proposed model uses separate but associated hidden variables to explore both topics and individual interests such that it can provide useful insights to the participants of online healthcare chat groups about their interests in terms of weighted topics or vice versa. Results: We evaluate the proposed model on a real-world chat log by comparing its performance to benchmark topic models, i.e., latent Dirichlet allocation (LDA) and Author Topic Model (ATM), on the topic extraction task The chat log is obtained from an online chat group of pregnant women, which consists of 233,452 chat word tokens contributed by 118 users. Both detected individual interests and underlying topics with their progressive information over time are demonstrated. The results show that the performance of the proposed model exceeds that of the benchmark models. Conclusion: The experimental results illustrate that the proposed model is a promising method for extracting healthcare knowledge from social media data. (C) 2016 Elsevier Inc. All rights reserved.

On mining latent topics from healthcare chat logs

Journal

JOURNAL OF BIOMEDICAL INFORMATICS

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

On mining latent topics from healthcare chat logs

Journal

JOURNAL OF BIOMEDICAL INFORMATICS

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper