4.7 Article

Network embedding framework for driver gene discovery by combining functional and structural information

期刊

BMC GENOMICS
卷 24, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12864-023-09515-x

关键词

Driver gene; Gene interaction network; Network embedding; Mutation data; Classification algorithm

向作者/读者索取更多资源

Comprehensive analysis of multiple data sets is important for identifying potential driver genes in different cancers. In this study, a network embedding framework is proposed to combine functional and structural information for driver gene identification. By constructing a mutation integration network and extracting gene features using the struc2vec model, our method outperforms previous methods in identifying driver genes for 12 cancers in the cancer genome atlas.
Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and structural information of genes in protein interaction networks to identify driver genes. Therefore, we propose a network embedding framework combining functional and structural information to identify driver genes. Firstly, we combine the mutation data and gene interaction networks to construct mutation integration network using network propagation algorithm. Secondly, the struc2vec model is used for extracting gene features from the mutation integration network, which contains both gene's functional and structural information. Finally, machine learning algorithms are utilized to identify the driver genes. Compared with the previous four excellent methods, our method can find gene pairs that are distant from each other through structural similarities and has better performance in identifying driver genes for 12 cancers in the cancer genome atlas. At the same time, we also conduct a comparative analysis of three gene interaction networks, three gene standard sets, and five machine learning algorithms. Our framework provides a new perspective for feature selection to identify novel driver genes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据