4.6 Article

An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites

期刊

BMC BIOINFORMATICS
卷 22, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12859-021-04445-5

关键词

Protein ubiquitylation site; Protein SUMOylation site; Convolution neural network; Deep learning; Ensemble learning

资金

  1. National Natural Science Funds of China [61802057]
  2. Jilin Provincial Natural Science Foundation [20210101174JC, 20200201288JC]
  3. Fundamental Research Funds for the Central Universities, JLU [93K172020K22]
  4. science and technology research project of 13th FiveYear of the Education Department of Jilin province [JJKH20200330KJ]

向作者/读者索取更多资源

This study proposed a novel deep learning architecture for predicting protein Ubiquitylation and SUMOylation sites as well as their crosstalk sites simultaneously. The method achieved promising AUCs of 0.838, 0.888, and 0.862 on Ubiquitylation, SUMOylation, and crosstalk sites respectively in tenfold cross-validation. The results also demonstrated the effectiveness of the proposed architecture in classifying Ubiquitylated and SUMOylated lysine residues.
Background Several computational tools for predicting protein Ubiquitylation and SUMOylation sites have been proposed to study their regulatory roles in gene location, gene expression, and genome replication. However, existing methods generally rely on feature engineering, and ignore the natural similarity between the two types of protein translational modification. This study is the first all-in-one deep network to predict protein Ubiquitylation and SUMOylation sites from protein sequences as well as their crosstalk sites simultaneously. Our deep learning architecture integrates several meta classifiers that apply deep neural networks to protein sequence information and physico-chemical properties, which were trained on multi-label classification mode for simultaneously identifying protein Ubiquitylation and SUMOylation as well as their crosstalk sites. Results The promising AUCs of our method on Ubiquitylation, SUMOylation and crosstalk sites achieved 0.838, 0.888, and 0.862 respectively on tenfold cross-validation. The corresponding APs reached 0.683, 0.804 and 0.552, which also validated our effectiveness. Conclusions The proposed architecture managed to classify ubiquitylated and SUMOylated lysine residues along with their crosstalk sites, and outperformed other well-known Ubiquitylation and SUMOylation site prediction tools.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据