4.6 Article

K-Module Algorithm: An Additional Step to Improve the Clustering Results of WGCNA Co-Expression Networks

期刊

GENES
卷 12, 期 1, 页码 -

出版社

MDPI
DOI: 10.3390/genes12010087

关键词

gene co-expression networks; distance correlation; connectivity; enrichment analysis

资金

  1. National Natural Science Foundation of China [41876100]
  2. Development Project of Applied Technology in Harbin [2016RAXXJ071]

向作者/读者索取更多资源

This paper introduces a new method for constructing gene co-expression networks—k-module algorithm, which uses distance correlation to calculate the similarity matrix and assigns all genes to the module with the highest mean connectivity, improving the clustering results of WGCNA. The algorithm has fewer iterations, lower complexity, and readjusts the hierarchical clustering results while retaining the advantages of the dynamic tree cut method.
Among biological networks, co-expression networks have been widely studied. One of the most commonly used pipelines for the construction of co-expression networks is weighted gene co-expression network analysis (WGCNA), which can identify highly co-expressed clusters of genes (modules). WGCNA identifies gene modules using hierarchical clustering. The major drawback of hierarchical clustering is that once two objects are clustered together, it cannot be reversed; thus, re-adjustment of the unbefitting decision is impossible. In this paper, we calculate the similarity matrix with the distance correlation for WGCNA to construct a gene co-expression network, and present a new approach called the k-module algorithm to improve the WGCNA clustering results. This method can assign all genes to the module with the highest mean connectivity with these genes. This algorithm re-adjusts the results of hierarchical clustering while retaining the advantages of the dynamic tree cut method. The validity of the algorithm is verified using six datasets from microarray and RNA-seq data. The k-module algorithm has fewer iterations, which leads to lower complexity. We verify that the gene modules obtained by the k-module algorithm have high enrichment scores and strong stability. Our method improves upon hierarchical clustering, and can be applied to general clustering algorithms based on the similarity matrix, not limited to gene co-expression network analysis.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据