4.5 Article

Does deep learning help topic extraction? A kernel k-means clustering method with word embedding

Journal

JOURNAL OF INFORMETRICS
Volume 12, Issue 4, Pages 1099-1117

Publisher

ELSEVIER
DOI: 10.1016/j.joi.2018.09.004

Keywords

Bibliometrics; Topic analysis; Cluster analysis; Text mining

Funding

  1. Australian Research Council [DP150101645]
  2. United States National Science Foundation [1759960]

Ask authors/readers for more resources

Topic extraction presents challenges for the bibliometric community, and its performance still depends on human intervention and its practical areas. This paper proposes a novel kernel k-means clustering method incorporated with a word embedding model to create a solution that effectively extracts topics from bibliometric data. The experimental results of a comparison of this method with four clustering baselines (i.e., k-means, fuzzy c-means, principal component analysis, and topic models) on two bibliometric datasets demonstrate its effectiveness across either a relatively broad range of disciplines or a given domain. An empirical study on bibliometric topic extraction from articles published by three top tier bibliometric journals between 2000 and 2017, supported by expert knowledge-based evaluations, provides supplemental evidence of the method's ability on topic extraction. Additionally, this empirical analysis reveals insights into both overlapping and diverse research interests among the three journals that would benefit journal publishers, editorial boards, and research communities. (C) 2018 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available