4.3 Article

In silico characterization of protein chimeras: Relating sequence and function within the same fold

期刊

出版社

WILEY
DOI: 10.1002/prot.22422

关键词

bioinformatics; protein design; DNA shuffling; protein structure; machine learning; kernel method

资金

  1. UQ-Enabling
  2. ARC Center of Excellence in Bioinformatics

向作者/读者索取更多资源

The exploration of novel proteins via recombination of fragments derived from structurally homologous proteins has enormous potential for medicine and biotechnology. This modular exchange of sequence material puts novel activities, substrate specificities, and stability within reach of a semi-random search. This article takes stock of the growing resource of experimentally characterized chimeric proteins within a homologous protein family to build sequence-function models that can effectively guide the construction of new libraries. A novel framework for predicting structural viability of chimeric proteins, only assuming knowledge of their sequence and their parental structure, is presented. Removing a major barrier in previous work, the model processes any sequence that derives from parents with similar folds. The method naturally mixes test and training data from site-directed recombination, DNA shuffling, or random mutagenesis experiments. We train a model from a site-directed recombination library with state-of-the-art prediction accuracy on hold-out test data from the same experimental source and convincing performance on chimeras with a different origin. Specifically, the model is used to assess the structural viability of P450 chimeras deriving from proteins with only 18% sequence similarity to those used for model timing.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据