☆ 4.1 Article

Integration of Machine Learning Improves The Prediction Accuracy of Molecular Modelling for M. jannaschii Tyrosyl-tRNA Synthetase Substrate Specificity

PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS (2021)

期刊

PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS

卷 48, 期 10, 页码 1214-1232

出版社

CHINESE ACAD SCIENCES, INST BIOPHYSICS

DOI: 10.16476/j.pibb.2020.0425

关键词

tyrosyl-tRNA synthetase; genetic code expansion; enzyme substrate specificity; Rosetta; molecular modelling; machine learning

类别

Biochemistry & Molecular Biology Biophysics

资金

National Natural Science Foundation of China [61431017]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Designing enzyme binding pockets to accommodate substrates with different chemical structures is a great challenge, traditionally requiring screening of thousands to millions of mutants. By integrating molecular modeling and data-driven machine learning, mutant libraries with high enrichment ratios can be generated to accelerate the screening process. This integrated workflow is expected to significantly benefit Mj. tyrRS mutant screening and reduce time and cost of wet-lab experiments.

Design of enzyme binding pocket to accommodate substrates with different chemical structure is a great challenge. Traditionally, thousands even millions of mutants have to be screened in wet-lab experiments to find a ligand-specific mutant and large amount of time and resources are consumed. To accelerate the screening process, we propose a novel workflow through integration of molecular modeling and data-driven machine learning method to generate mutant libraries with high enrichment ratio for recognition of specific substrate. We collected all the M. janonschii tyrosyl-tRNA synthetase (Mj. TyrRS) mutants reported in the literature to compare and analyze the sequence and structural feature and difference between mutant and wild type Mj. TyrRS. Mj. TyrRS is used as an example since the sequences and structures of many unnatural amino acid specific Mj. TyrRS mutants have been reported. Based on the crystal structures of different Mj. TyrRS mutants and Rosetta modeling result, we found D158G/P is the critical residue which influences the backbone disruption of helix with residue 158-163. Our results showed that compared with random mutation, Rosetta modeling and score function calculation can elevate the enrichment ratio of desired mutants by 2-fold in a test library having 687 mutants, while after calibration by machine learning model trained using known data of Mj. TyrRS mutants and ligand, the enrichment ratio can be elevated by 11-fold. This molecular modeling and machine learning-integrated workflow is anticipated to significantly benefit to the Mj. tyrRS mutant screening and substantially reduce the time and cost of wet-lab experiments. Besides, this novel process will have broad application in the field of computational protein design.

Integration of Machine Learning Improves The Prediction Accuracy of Molecular Modelling for M. jannaschii Tyrosyl-tRNA Synthetase Substrate Specificity

期刊

PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS

出版社

CHINESE ACAD SCIENCES, INST BIOPHYSICS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Integration of Machine Learning Improves The Prediction Accuracy of Molecular Modelling for M. jannaschii Tyrosyl-tRNA Synthetase Substrate Specificity

期刊

PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS

出版社

CHINESE ACAD SCIENCES, INST BIOPHYSICS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文