4.6 Article

Dynamic Online HDP model for discovering evolutionary topics from Chinese social texts

Journal

NEUROCOMPUTING
Volume 171, Issue -, Pages 412-424

Publisher

ELSEVIER SCIENCE BV
DOI: 10.1016/j.neucom.2015.06.047

Keywords

Hierarchical Dirichlet Process; Topic probability model dynamic topic discovery; Chinese social media

Funding

  1. National Natural Science Foundation of China [61472258, 61402294, 61202159]
  2. Guangdong Natural Science Foundation [S2013040012895]
  3. Foundation for Distinguished Young Talents in Higher Education of Guangdong, China [2013LYM_0076]
  4. Science and Technology Foundation of Shenzhen City [JCYJ20140509172609162, JCYJ20130329102032059]

Ask authors/readers for more resources

User-generated content such as online reviews in social media evolve rapidly over time. To better understand the social media content, users not only want to examine what the topics are, but also want to discover the topic evolution patterns. In this paper, we propose a Dynamic Online Hierarchical Dirichlet Process model (DOHDP) to discover the evolutionary topics for Chinese social texts. In our DOHDP model, the evolutionary processes of topics are considered as evolutions in two levels, i.e. interepoch level and intra-epoch level. In inter-epoch level, the corpus of each epoch is modeled with an online HOP topic model, and the social texts are generated in a sequence mode. In the intra-epoch level, the time dependencies of historical epochs are modeled with an exponential decay function in which more recent epochs have a relatively stronger influence on the model parameters than the earlier epoch. Furthermore, we implement our DOHDP model using a two-phase online variational algorithm. Through comparing our DOHDP model with other related topic models on Chinese social media dataset Tianya-80299, the experiment results show that DOHDP model provides the best performance for discovering the evolutionary topics of Chinese social texts. (C) 2015 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available