4.7 Article

Predicting Structural Susceptibility of Proteins to Proteolytic Processing

Journal

Publisher

MDPI
DOI: 10.3390/ijms241310761

Keywords

regulatory proteolysis; proteases; protease substrates; substrate identification

Ask authors/readers for more resources

The importance of 3D protein structure in proteolytic processing is well known. We developed a method that utilizes the structural features of potential substrates to predict proteolytic sites, taking into consideration factors such as solvent accessibility, secondary structure, and temperature factor. By carefully curating the training set, selecting an appropriate machine learning method, and determining the optimal positive-to-negative class size ratio, we demonstrated that our method outperforms existing bioinformatics methods for predicting proteolytic sites. This method may also have applications in predicting other post-translational modifications.
The importance of 3D protein structure in proteolytic processing is well known. However, despite the plethora of existing methods for predicting proteolytic sites, only a few of them utilize the structural features of potential substrates as predictors. Moreover, to our knowledge, there is currently no method available for predicting the structural susceptibility of protein regions to proteolysis. We developed such a method using data from CutDB, a database that contains experimentally verified proteolytic events. For prediction, we utilized structural features that have been shown to influence proteolysis in earlier studies, such as solvent accessibility, secondary structure, and temperature factor. Additionally, we introduced new structural features, including length of protruded loops and flexibility of protein termini. To maximize the prediction quality of the method, we carefully curated the training set, selected an appropriate machine learning method, and sampled negative examples to determine the optimal positive-to-negative class size ratio. We demonstrated that combining our method with models of protease primary specificity can outperform existing bioinformatics methods for the prediction of proteolytic sites. We also discussed the possibility of utilizing this method for bioinformatics prediction of other post-translational modifications.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available