4.7 Article

SIPGCN: A Novel Deep Learning Model for Predicting Self-Interacting Proteins from Sequence Information Using Graph Convolutional Networks

期刊

BIOMEDICINES
卷 10, 期 7, 页码 -

出版社

MDPI
DOI: 10.3390/biomedicines10071543

关键词

self-interacting protein; graph convolutional networks; protein-protein interactions; random forest

资金

  1. Major projects of the Ministry of science and technology [2021ZD0200403]
  2. National Natural Science Foundation of China [62172355, 61702444]
  3. Tianshan Youth, Excellent Youth [2019Q029]
  4. theWest Light Foundation of The Chinese Academy of Sciences [2018-XBQNXZ-B-008]
  5. Qingtan Scholar Talent Project of Zaozhuang University

向作者/读者索取更多资源

Protein is the fundamental organic substance in cells that plays a crucial role in biological activities. Self-interacting protein (SIP) is an important protein interaction. This study presents a SIP prediction method, SIPGCN, using a deep learning graph convolutional network (GCN). The results demonstrate excellent performance of SIPGCN.
Protein is the basic organic substance that constitutes the cell and is the material condition for the life activity and the guarantee of the biological function activity. Elucidating the interactions and functions of proteins is a central task in exploring the mysteries of life. As an important protein interaction, self-interacting protein (SIP) has a critical role. The fast growth of high-throughput experimental techniques among biomolecules has led to a massive influx of available SIP data. How to conduct scientific research using the massive amount of SIP data has become a new challenge that is being faced in related research fields such as biology and medicine. In this work, we design an SIP prediction method SIPGCN using a deep learning graph convolutional network (GCN) based on protein sequences. First, protein sequences are characterized using a position-specific scoring matrix, which is able to describe the biological evolutionary message, then their hidden features are extracted by the deep learning method GCN, and, finally, the random forest is utilized to predict whether there are interrelationships between proteins. In the cross-validation experiment, SIPGCN achieved 93.65% accuracy and 99.64% specificity in the human data set. SIPGCN achieved 90.69% and 99.08% of these two indicators in the yeast data set, respectively. Compared with other feature models and previous methods, SIPGCN showed excellent results. These outcomes suggest that SIPGCN may be a suitable instrument for predicting SIP and may be a reliable candidate for future wet experiments.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据