4.5 Article

Model-driven design of synthetic N-terminal coding sequences for regulating gene expression in yeast and bacteria

期刊

BIOTECHNOLOGY JOURNAL
卷 17, 期 5, 页码 -

出版社

WILEY-V C H VERLAG GMBH
DOI: 10.1002/biot.202100655

关键词

Bacillus subtilis; biosynthesis pathway; N-terminal coding sequences; multi-view learning; Saccharomyces cerevisiae

资金

  1. National Key Research and Development Program of China [2020YFA0908300]
  2. National Natural Science Foundation of China [32172349]
  3. Natural Science Foundation of Jiangsu Province [BK20200085]
  4. Key Research and Development Program of Jiangsu Province [BE2019628]
  5. Fundamental Research Funds for the Central Universities [JUSRP22036]

向作者/读者索取更多资源

In this study, a multi-view learning strategy was used to generate synthetic NCSs for regulating gene expression in S. cerevisiae and B. subtilis. The synthetic NCSs improved bioproduction of specific compounds in these microorganisms.
N-terminal coding sequences (NCSs) are key regulatory elements for fine-tuning gene expression during translation initiation-the rate-limiting step of translation. However, owing to the complex combinatory effects of NCS biophysical factors and endogenous regulation, designing NCSs remains challenging. In this study, a multi-view learning strategy for model-driven generation of synthetic NCSs for Saccharomyces cerevisiae and Bacillus subtilis are implemented, which are widely used in laboratories and industries. NCS libraries for S. cerevisiae and B. subtilis with nearly 150,000 cells were sorted. Next, model training was performed with NCS deep features extracted from DNA, codon, and amino acid sequences, as well as calculated features from the minimum free energy (MFE) and tRNA adaption index. Two models were separately developed for generating synthetic NCSs for both up- and down-regulating gene expression with accuracies higher than 65% for S. cerevisiae and B. subtilis. Synthetic NCSs were then applied to enhance bioproduction, yielding 1.48- and 1.71-fold production improvements of D-limonene by S. cerevisiae and ovalbumin by B. subtilis, respectively. This work provides model-driven design of synthetic NCSs as a toolbox for regulating gene expression in S. cerevisiae and B. subtilis. The machine learning-based modeling approach can be used for NCS design in other microorganisms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据