4.7 Article

BiGRUD-SA: Protein S-sulfenylation sites prediction based on BiGRU and self-attention

期刊

COMPUTERS IN BIOLOGY AND MEDICINE
卷 163, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compbiomed.2023.107145

关键词

S-sulfenylation sites; Multi-information fusion; SMOTE-Tomek; BiGRU; Self-attention mechanism

向作者/读者索取更多资源

S-sulfenylation is an important post-translational modification of proteins that has implications for signal transduction and protein function regulation. Predicting S-sulfenylation sites using computational methods is crucial for studying protein function and related biological mechanisms due to experimental limitations. In this paper, a method called BiGRUD-SA, based on BiGRU and self-attention mechanism, is proposed to predict protein S-sulfenylation sites.
S-sulfenylation is a vital post-translational modification (PTM) of proteins, which is an intermediate in other redox reactions and has implications for signal transduction and protein function regulation. However, there are many restrictions on the experimental identification of S-sulfenylation sites. Therefore, predicting S-sulfoylation sites by computational methods is fundamental to studying protein function and related biological mechanisms. In this paper, we propose a method named BiGRUD-SA based on bi-directional gated recurrent unit (BiGRU) and self-attention mechanism to predict protein S-sulfenylation sites. We first use AAC, BLOSUM62, AAindex, EAAC and GAAC to extract features, and do feature fusion to obtain original feature space. Next, we use SMOTE-Tomek method to handle data imbalance. Then, we input the processed data to the BiGRU and use self-attention mechanism to do further feature extraction. Finally, we input the data obtained to the deep neural networks (DNN) to identify S-sulfenylation sites. The accuracies of training set and independent test set are 96.66% and 95.91% respectively, which indicates that our method is conducive to identifying S-sulfenylation sites. Furthermore, we use a data set of S-sulfenylation sites in Arabidopsis thaliana to effectively verify the generalization ability of BiGRUD-SA method, and obtain better prediction results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据