4.7 Article

ProTstab2 for Prediction of Protein Thermal Stabilities

Journal

Publisher

MDPI
DOI: 10.3390/ijms231810798

Keywords

protein cellular stability; stability prediction; protein property; machine learning predictor; artificial intelligence; gradient boosting

Funding

  1. Key Project of Natural Science Foundation of the Jiangsu Higher Education Institutions of China [20KJA520010]
  2. Collaborative Innovation Center of Novel Software Technology and Industrialization, Vetenskapsradet [2019-01403]
  3. Swedish Cancer Society [CAN 20 1350]
  4. Formas [2019-01403] Funding Source: Formas
  5. Forte [2019-01403] Funding Source: Forte
  6. Swedish Research Council [2019-01403] Funding Source: Swedish Research Council

Ask authors/readers for more resources

The stability of proteins plays a crucial role in various biological processes and applications. Experimental determination of protein stability has been challenging with limited data. In this study, a machine learning predictor, ProTstab2, was developed using limited proteolysis and mass spectrometry approaches to accurately predict the thermal stabilities of proteins. The ProTstab2 showed superior performance compared to previous methods, and it was applied to predict and compare the stabilities of proteins in human, mouse, and zebrafish proteomes. The tool is freely available for use.
The stability of proteins is an essential property that has several biological implications. Knowledge about protein stability is important in many ways, ranging from protein purification and structure determination to stability in cells and biotechnological applications. Experimental determination of thermal stabilities has been tedious and available data have been limited. The introduction of limited proteolysis and mass spectrometry approaches has facilitated more extensive cellular protein stability data production. We collected melting temperature information for 34,913 proteins and developed a machine learning predictor, ProTstab2, by utilizing a gradient boosting algorithm after testing seven algorithms. The method performance was assessed on a blind test data set and showed a Pearson correlation coefficient of 0.753 and root mean square error of 7.005. Comparison to previous methods indicated that ProTstab2 had superior performance. The method is fast, so it was applied to predict and compare the stabilities of all proteins in human, mouse, and zebrafish proteomes for which experimental data were not determined. The tool is freely available.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available