4.7 Article

A Computational Framework for Identifying Promoter Sequences in Nonmodel Organisms Using RNA-seq Data Sets

Journal

ACS SYNTHETIC BIOLOGY
Volume 10, Issue 6, Pages 1394-1405

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acssynbio.1c00017

Keywords

promoter prediction; metabolic engineering; M. buryatense; RNA-seq; XylE assay; synthetic biology

Funding

  1. NSF GRFP [DGE-1762114]
  2. University of Washington
  3. HDR: I-DIRSEFW: Accelerating the Engineering Design and Manufacturing Life-Cycle with Data Science [NSF #1934292]

Ask authors/readers for more resources

This study developed a computational framework to identify constitutively, strongly expressed genes and predict strong promoter signals using standard RNA-seq data sets. The framework was applied to methanotroph Methylotuvimicrobium buryatense 5GB1, identifying 25 genes with high expression levels across diverse experimental conditions. The predicted promoter motifs were experimentally validated and found to be biologically meaningful for engineering diverse microorganisms for biomolecule production.
Engineering microorganisms into biological factories that convert renewable feedstocks into valuable materials is a major goal of synthetic biology; however, for many nonmodel organisms, we do not yet have the genetic tools, such as suites of strong promoters, necessary to effectively engineer them. In this work, we developed a computational framework that can leverage standard RNA-seq data sets to identify sets of constitutive, strongly expressed genes and predict strong promoter signals within their upstream regions. The framework was applied to a diverse collection of RNA-seq data measured for the methanotroph Methylotuvimicrobium buryatense 5GB1 and identified 25 genes that were constitutively, strongly expressed across 12 experimental conditions. For each gene, the framework predicted short (27-30 nucleotide) sequences as candidate promoters and derived -35 and -10 consensus promoter motifs (TTGACA and TATAAT, respectively) for strong expression in M. buryatense. This consensus closely matches the canonical E. coli sigma-70 motif and was found to be enriched in promoter regions of the genome. A subset of promoter predictions was experimentally validated in a XylE reporter assay, including the consensus promoter, which showed high expression. The pmoC, pqqA, and ssrA promoter predictions were additionally screened in an experiment that scrambled the -35 and -10 signal sequences, confirming that transcription initiation was disrupted when these specific regions of the predicted sequence were altered. These results indicate that the computational framework can make biologically meaningful promoter predictions and identify key pieces of regulatory systems that can serve as foundational tools for engineering diverse microorganisms for biomolecule production.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available