4.7 Article

BiGRUD-SA: Protein S-sulfenylation sites prediction based on BiGRU and self-attention

Journal

COMPUTERS IN BIOLOGY AND MEDICINE
Volume 163, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compbiomed.2023.107145

Keywords

S-sulfenylation sites; Multi-information fusion; SMOTE-Tomek; BiGRU; Self-attention mechanism

Ask authors/readers for more resources

S-sulfenylation is an important post-translational modification of proteins that has implications for signal transduction and protein function regulation. Predicting S-sulfenylation sites using computational methods is crucial for studying protein function and related biological mechanisms due to experimental limitations. In this paper, a method called BiGRUD-SA, based on BiGRU and self-attention mechanism, is proposed to predict protein S-sulfenylation sites.
S-sulfenylation is a vital post-translational modification (PTM) of proteins, which is an intermediate in other redox reactions and has implications for signal transduction and protein function regulation. However, there are many restrictions on the experimental identification of S-sulfenylation sites. Therefore, predicting S-sulfoylation sites by computational methods is fundamental to studying protein function and related biological mechanisms. In this paper, we propose a method named BiGRUD-SA based on bi-directional gated recurrent unit (BiGRU) and self-attention mechanism to predict protein S-sulfenylation sites. We first use AAC, BLOSUM62, AAindex, EAAC and GAAC to extract features, and do feature fusion to obtain original feature space. Next, we use SMOTE-Tomek method to handle data imbalance. Then, we input the processed data to the BiGRU and use self-attention mechanism to do further feature extraction. Finally, we input the data obtained to the deep neural networks (DNN) to identify S-sulfenylation sites. The accuracies of training set and independent test set are 96.66% and 95.91% respectively, which indicates that our method is conducive to identifying S-sulfenylation sites. Furthermore, we use a data set of S-sulfenylation sites in Arabidopsis thaliana to effectively verify the generalization ability of BiGRUD-SA method, and obtain better prediction results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available