☆ 4.2 Article

Predicting protein model correctness in Coot using machine learning

ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY (2020)

期刊

ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY

卷 76, 期 -, 页码 713-723

出版社

INT UNION CRYSTALLOGRAPHY

DOI: 10.1107/S2059798320009080

关键词

structure solution; software; model building; validation; machine learning; Coot

类别

Biochemical Research Methods Biochemistry & Molecular Biology Biophysics Crystallography

资金

White Rose BBSRC DTP in Mechanistic Biology [BB/M011151/1]
BBSRC [BB/S005099/1]
BBSRC [BB/S005099/1, 1792631] Funding Source: UKRI

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Manually identifying and correcting errors in protein models can be a slow process, but improvements in validation tools and automated model-building software can contribute to reducing this burden. This article presents a new correctness score that is produced by combining multiple sources of information using a neural network. The residues in 639 automatically built models were marked as correct or incorrect by comparing them with the coordinates deposited in the PDB. A number of features were also calculated for each residue usingCoot, including map-to-model correlation, density values,Bfactors, clashes, Ramachandran scores, rotamer scores and resolution. Two neural networks were created using these features as inputs: one to predict the correctness of main-chain atoms and the other for side chains. The 639 structures were split into 511 that were used to train the neural networks and 128 that were used to test performance. The predicted correctness scores could correctly categorize 92.3% of the main-chain atoms and 87.6% of the side chains. ACootML Correctness script was written to display the scores in a graphical user interface as well as for the automatic pruning of chains, residues and side chains with low scores. The automatic pruning function was added to theCCP4i2Buccaneerautomated model-building pipeline, leading to significant improvements, especially for high-resolution structures.

Predicting protein model correctness in Coot using machine learning

期刊

ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY

出版社

INT UNION CRYSTALLOGRAPHY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Predicting protein model correctness in Coot using machine learning

期刊

ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY

出版社

INT UNION CRYSTALLOGRAPHY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文