4.6 Article

Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features

期刊

FRONTIERS IN GENETICS
卷 12, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fgene.2021.783128

关键词

protein subcellular location; protein-protein interaction network; GO enrichment; KEGG enrichment; feature selection; classification algorithm

资金

  1. Strategic Priority Research Program of Chinese Academy of Sciences [XDA26040304, XDB38050200]
  2. National Key R&D Program of China [2018YFC0910403]
  3. Fund of the Key Laboratory of Tissue Microenvironment and Tumor of Chinese Academy of Sciences [202002]

向作者/读者索取更多资源

The study utilized a computational model to predict protein subcellular localizations based on protein-protein interaction networks and functional annotation information. Key proteins and quantitative rules were discovered to contribute to determining protein subcellular locations. This approach may help advance the development of predictive technologies on subcellular localizations and explore the biological significance of protein subcellular localization patterns.
Given the limitation of technologies, the subcellular localizations of proteins are difficult to identify. Predicting the subcellular localization and the intercellular distribution patterns of proteins in accordance with their specific biological roles, including validated functions, relationships with other proteins, and even their specific sequence characteristics, is necessary. The computational prediction of protein subcellular localizations can be performed on the basis of the sequence and the functional characteristics. In this study, the protein-protein interaction network, functional annotation of proteins and a group of direct proteins with known subcellular localization were used to construct models. To build efficient models, several powerful machine learning algorithms, including two feature selection methods, four classification algorithms, were employed. Some key proteins and functional terms were discovered, which may provide important contributions for determining protein subcellular locations. Furthermore, some quantitative rules were established to identify the potential subcellular localizations of proteins. As the first prediction model that uses direct protein annotation information (i.e., functional features) and STRING-based protein-protein interaction network (i.e., network features), our computational model can help promote the development of predictive technologies on subcellular localizations and provide a new approach for exploring the protein subcellular localization patterns and their potential biological importance.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据