4.7 Article

Active link selection for efficient semi-supervised community detection

期刊

SCIENTIFIC REPORTS
卷 5, 期 -, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/srep09039

关键词

-

资金

  1. National Natural Science Foundation of China [61422213, 61303110]
  2. National Basic Research Program of China [2013CB329305]
  3. Chinese Academy of Sciences [XDA06030601]
  4. PhD Programs Foundation of Ministry of Education of China [20130032120043]
  5. Foundation for the Young Scholars by Tianjin University of Commerce [150113]
  6. National Training Programs of Innovation and Entrepreneurship for Undergraduates [201410069040]

向作者/读者索取更多资源

Several semi-supervised community detection algorithms have been proposed recently to improve the performance of traditional topology-based methods. However, most of them focus on how to integrate supervised information with topology information; few of them pay attention to which information is critical for performance improvement. This leads to large amounts of demand for supervised information, which is expensive or difficult to obtain in most fields. For this problem we propose an active link selection framework, that is we actively select the most uncertain and informative links for human labeling for the efficient utilization of the supervised information. We also disconnect the most likely inter-community edges to further improve the efficiency. Our main idea is that, by connecting uncertain nodes to their community hubs and disconnecting the inter-community edges, one can sharpen the block structure of adjacency matrix more efficiently than randomly labeling links as the existing methods did. Experiments on both synthetic and real networks demonstrate that our new approach significantly outperforms the existing methods in terms of the efficiency of using supervised information. It needs similar to 13% of the supervised information to achieve a performance similar to that of the original semi-supervised approaches.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据