4.4 Article

A TESTING BASED EXTRACTION ALGORITHM FOR IDENTIFYING SIGNIFICANT COMMUNITIES IN NETWORKS

期刊

ANNALS OF APPLIED STATISTICS
卷 8, 期 3, 页码 1853-1891

出版社

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/14-AOAS760

关键词

Community detection; networks; extraction; background; multiple testing

资金

  1. NSF [DMS-09-07177, DMS-13-10002, DMS-06-45369, DMS-11-05581, SES-1357622]
  2. James S. McDonnell Foundation 21st Century Science Initiative-Complex Systems Scholar Award [220020315]

向作者/读者索取更多资源

A common and important problem arising in the study of networks is how to divide the vertices of a given network into one or more groups, called communities, in such a way that vertices of the same community are more interconnected than vertices belonging to different ones. We propose and investigate a testing based community detection procedure called Extraction of Statistically Significant Communities (ESSC). The ESSC procedure is based on p-values for the strength of connection between a single vertex and a set of vertices under a reference distribution derived from a conditional configuration network model. The procedure automatically selects both the number of communities in the network and their size. Moreover, ESSC can handle overlapping communities and, unlike the majority of existing methods, identifies background vertices that do not belong to a well-defined community. The method has only one parameter, which controls the stringency of the hypothesis tests. We investigate the performance and potential use of ESSC and compare it with a number of existing methods, through a validation study using four real network data sets. In addition, we carry out a simulation study to assess the effectiveness of ESSC in networks with various types of community structure, including networks with overlapping communities and those with background vertices. These results suggest that ESSC is an effective exploratory tool for the discovery of relevant community structure in complex network systems. Data and software are available at http://www.unc.edu/similar to jameswd/research.html.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据