4.4 Article

Structure-based prediction of the effects of a missense variant on protein stability

期刊

AMINO ACIDS
卷 44, 期 3, 页码 847-855

出版社

SPRINGER WIEN
DOI: 10.1007/s00726-012-1407-7

关键词

Amino acid mutation; Physicochemical properties; Residue-residue contact energy; Support vector machine; Protein stability prediction

资金

  1. National Nature Science Foundation of China [31170795, 20872107]
  2. Specialized Research Fund for the Doctoral Program of Higher Education of China [20113201110015]
  3. Scientific Research Foundation for Returned Scholars, Ministry of Education of China
  4. International S&T Cooperation Program of Suzhou [SH201120]
  5. National 973 Programs of China [2010CB945600]
  6. K.C Wong education foundation, Hong Kong
  7. Competitive Research Funding of Tampere University Hospital
  8. Sigrid Juselius Foundation
  9. Biocenter Finland

向作者/读者索取更多资源

Predicting the effects of amino acid substitutions on protein stability provides invaluable information for protein design, the assignment of biological function, and for understanding disease-associated variations. To understand the effects of substitutions, computational models are preferred to time-consuming and expensive experimental methods. Several methods have been proposed for this task including machine learning-based approaches. However, models trained using limited data have performance problems and many model parameters tend to be over-fitted. To decrease the number of model parameters and to improve the generalization potential, we calculated the amino acid contact energy change for point variations using a structure-based coarse-grained model. Based on the structural properties including contact energy (CE) and further physicochemical properties of the amino acids as input features, we developed two support vector machine classifiers. M47 predicted the stability of variant proteins with an accuracy of 87 % and a Matthews correlation coefficient of 0.68 for a large dataset of 1925 variants, whereas M8 performed better when a relatively small dataset of 388 variants was used for 20-fold cross-validation. The performance of the M47 classifier on all six tested contingency table evaluation parameters is better than that of existing machine learning-based models or energy function-based protein stability classifiers.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据