Journal
BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 5, Pages -Publisher
OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac323
Keywords
protein S-sulfinylation; convolutional neural network; bidirectional LSTM; self-attention mechanism; residual connection
Funding
- Fundamental Research Funds for the Central Universities [3132022257]
Ask authors/readers for more resources
A multi-module deep learning framework named DLF-Sul is proposed for the prediction of S-sulfinylation sites in proteins, achieving good performance. This study is of great importance for understanding the role of S-sulfinylation in cell biology and human diseases.
Protein S-sulfinylation is an important posttranslational modification that regulates a variety of cell and protein functions. This modification has been linked to signal transduction, redox homeostasis and neuronal transmission in studies. Therefore, identification of S-sulfinylation sites is crucial to understanding its structure and function, which is critical in cell biology and human diseases. In this study, we propose a multi-module deep learning framework named DLF-Sul for identification of S-sulfinylation sites in proteins. First, three types of features are extracted including binary encoding, BLOSUM62 and amino acid index. Then, sequential features are further extracted based on these three types of features using bidirectional long short-term memory network. Next, multi-head self-attention mechanism is utilized to filter the effective attribute information, and residual connection helps to reduce information loss. Furthermore, convolutional neural network is employed to extract local deep features information. Finally, fully connected layers acts as classifier that map samples to corresponding label. Performance metrics on independent test set, including sensitivity, specificity, accuracy, Matthews correlation coefficient and area under curve, reach 91.80%, 92.36%, 92.08%, 0.8416 and 96.40%, respectively. The results show that DLF-Sul is an effective tool for predicting S-sulfinylation sites. The source code is available on the website https://github.com/ningq669/DLF-Sul.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available