☆ 4.2 Article

A framework for stability-based module detection in correlation graphs

STATISTICAL ANALYSIS AND DATA MINING (2021)

期刊

STATISTICAL ANALYSIS AND DATA MINING

卷 14, 期 2, 页码 129-143

出版社

WILEY

DOI: 10.1002/sam.11495

关键词

clustering; graphical model; Jaccard coefficient; module detection; network; stability

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications Statistics & Probability

资金

National Cancer Institute [P30CA016056, U24CA232979]
National Institute of Environmental Health Sciences [R01ES018846, R21ES026429]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Graphs are used to represent relationships between variables, and detecting structure within a graph is a challenging problem. This study addresses the issue of uncertainty in module detection by utilizing a nonparametric bootstrap approach to assess stability in a graph. The results show that this method can optimize stability in module detection within a graph.

Graphs can be used to represent the direct and indirect relationships between variables, and elucidate complex relationships and interdependencies. Detecting structure within a graph is a challenging problem. This problem is studied over a range of fields and is sometimes termed community detection, module detection, or graph partitioning. A popular class of algorithms for module detection relies on optimizing a function of modularity to identify the structure. In practice, graphs are often learned from the data, and thus prone to uncertainty. In these settings, the uncertainty of the network structure can become exaggerated by giving unreliable estimates of the module structure. In this work, we begin to address this challenge through the use of a nonparametric bootstrap approach to assessing the stability of module detection in a graph. Estimates of stability are presented at the level of the individual node, the inferred modules, and as an overall measure of performance for module detection in a given graph. Furthermore, bootstrap stability estimates are derived for complexity parameter selection that ultimately defines a graph from data in a way that optimizes stability. This approach is utilized in connection with correlation graphs but is generalizable to other graphs that are defined through the use of dissimilarity measures. We demonstrate our approach using a broad range of simulations and on a metabolomics dataset from the Beijing Olympics Air Pollution study. These approaches are implemented using bootcluster package that is available in the R programming language.

A framework for stability-based module detection in correlation graphs

期刊

STATISTICAL ANALYSIS AND DATA MINING

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A framework for stability-based module detection in correlation graphs

期刊

STATISTICAL ANALYSIS AND DATA MINING

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文