4.7 Review

Assessing the performance of computational predictors for estimating protein stability changes upon missense mutations

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 22, Issue 6, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab184

Keywords

protein stability change; bioinformatics; machine learning; deep learning; feature engineering; predictors

Funding

  1. National Health and Medical Research Council of Australia (NHMRC) [1144652, 1127948, 1174405]
  2. Major Inter-Disciplinary Research (IDR) project - Monash University, Dept of Data Science and AI, Faculty of IT, Monash University
  3. Collaborative Research Program of Institute for Chemical Research, Kyoto University [2021-28, 201932, 2018-28]
  4. Victorian Government's OIS Program
  5. National Health and Medical Research Council of Australia [1174405, 1127948, 1144652] Funding Source: NHMRC

Ask authors/readers for more resources

This study provides a comprehensive overview of computational tools for predicting protein stability changes upon mutations, evaluating their performance on different datasets based on mutation location and type. Results indicate that predictor performance is influenced by mutation location and type, with varying performance under different conditions for different tools.
Understanding how a mutation might affect protein stability is of significant importance to protein engineering and for understanding protein evolution genetic diseases. While a number of computational tools have been developed to predict the effect of missense mutations on protein stability protein stability upon mutations, they are known to exhibit large biases imparted in part by the data used to train and evaluate them. Here, we provide a comprehensive overview of predictive tools, which has provided an evolving insight into the importance and relevance of features that can discern the effects of mutations on protein stability. A diverse selection of these freely available tools was benchmarked using a large mutation-level blind dataset of 1342 experimentally characterised mutations across 130 proteins from ThermoMutDB, a second test dataset encompassing 630 experimentally characterised mutations across 39 proteins from iStable2.0 and a third blind test dataset consisting of 268 mutations in 27 proteins from the newly published ProThermDB. The performance of the methods was further evaluated with respect to the site of mutation, type of mutant residue and by ranging the pH and temperature. Additionally, the classification performance was also evaluated by classifying the mutations as stabilizing (Delta Delta G >= 0) or destabilizing (Delta Delta G<0). The results reveal that the performance of the predictors is affected by the site of mutation and the type of mutant residue. Further, the results show very low performance for pH values 6-8 and temperature higher than 65 for all predictors except iStable2.0 on the S630 dataset. To illustrate how stability and structure change upon single point mutation, we considered four stabilizing, two destabilizing and two stabilizing mutations from two proteins, namely the toxin protein and bovine liver cytochrome. Overall, the results on S268, S630 and S1342 datasets show that the performance of the integrated predictors is better than the mechanistic or individual machine learning predictors. We expect that this paper will provide useful guidance for the design and development of next-generation bioinformatic tools for predicting protein stability changes upon mutations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available