Journal
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES
Volume 24, Issue 13, Pages -Publisher
MDPI
DOI: 10.3390/ijms241310761
Keywords
regulatory proteolysis; proteases; protease substrates; substrate identification
Ask authors/readers for more resources
The importance of 3D protein structure in proteolytic processing is well known. We developed a method that utilizes the structural features of potential substrates to predict proteolytic sites, taking into consideration factors such as solvent accessibility, secondary structure, and temperature factor. By carefully curating the training set, selecting an appropriate machine learning method, and determining the optimal positive-to-negative class size ratio, we demonstrated that our method outperforms existing bioinformatics methods for predicting proteolytic sites. This method may also have applications in predicting other post-translational modifications.
The importance of 3D protein structure in proteolytic processing is well known. However, despite the plethora of existing methods for predicting proteolytic sites, only a few of them utilize the structural features of potential substrates as predictors. Moreover, to our knowledge, there is currently no method available for predicting the structural susceptibility of protein regions to proteolysis. We developed such a method using data from CutDB, a database that contains experimentally verified proteolytic events. For prediction, we utilized structural features that have been shown to influence proteolysis in earlier studies, such as solvent accessibility, secondary structure, and temperature factor. Additionally, we introduced new structural features, including length of protruded loops and flexibility of protein termini. To maximize the prediction quality of the method, we carefully curated the training set, selected an appropriate machine learning method, and sampled negative examples to determine the optimal positive-to-negative class size ratio. We demonstrated that combining our method with models of protease primary specificity can outperform existing bioinformatics methods for the prediction of proteolytic sites. We also discussed the possibility of utilizing this method for bioinformatics prediction of other post-translational modifications.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available