4.8 Article

Simple and accurate transcriptional start site identification using Smar2C2 and examination of conserved promoter features

Journal

PLANT JOURNAL
Volume 112, Issue 2, Pages 583-596

Publisher

WILEY
DOI: 10.1111/tpj.15957

Keywords

transcription start site; promoter; cis-regulatory elements; template switching reverse transcriptase; rolling circle amplification; technical advance

Categories

Funding

  1. National Institute of General Medical Sciences of the National Institutes of Health [T32GM007103]
  2. National Science Foundation [IOS-1546867, IOS-1856627]
  3. Georgia Advanced Computing Resource Center
  4. Georgia Genomics & Bioinformatics Core

Ask authors/readers for more resources

The accurate identification and quantification of transcriptional start sites (TSSs) is crucial for understanding transcription control. In this study, the researchers developed Smar2C2, a new method that allows for the easy and efficient identification of TSSs and transcription termination sites. Using this method, they were able to identify TSSs in multiple plant species and discover evolutionarily conserved features as well as sequence variations in known promoter motifs that may have significant implications for our understanding and control of transcription initiation.
The precise and accurate identification and quantification of transcriptional start sites (TSSs) is key to understanding the control of transcription. The core promoter consists of the TSS and proximal non-coding sequences, which are critical in transcriptional regulation. Therefore, the accurate identification of TSSs is important for understanding the molecular regulation of transcription. Existing protocols for TSS identification are challenging and expensive, leaving high-quality data available for a small subset of organisms. This sparsity of data impairs study of TSS usage across tissues or in an evolutionary context. To address these shortcomings, we developed Smart-Seq2 Rolling Circle to Concatemeric Consensus (Smar2C2), which identifies and quantifies TSSs and transcription termination sites. Smar2C2 incorporates unique molecular identifiers that allowed for the identification of as many as 70 million sites, with no known upper limit. We have also generated TSS data sets from as little as 40 pg of total RNA, which was the smallest input tested. In this study, we used Smar2C2 to identify TSSs in Glycine max (soybean), Oryza sativa (rice), Sorghum bicolor (sorghum), Triticum aestivum (wheat) and Zea mays (maize) across multiple tissues. This wide panel of plant TSSs facilitated the identification of evolutionarily conserved features, such as novel patterns in the dinucleotides that compose the initiator element (Inr), that correlated with promoter expression levels across all species examined. We also discovered sequence variations in known promoter motifs that are positioned reliably close to the TSS, such as differences in the TATA box and in the Inr that may prove significant to our understanding and control of transcription initiation. Smar2C2 allows for the easy study of these critical sequences, providing a tool to facilitate discovery.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available