4.8 Article

Characteristics and Significance of Intergenic Polyadenylated RNA Transcription in Arabidopsis

Journal

PLANT PHYSIOLOGY
Volume 161, Issue 1, Pages 210-224

Publisher

AMER SOC PLANT BIOLOGISTS
DOI: 10.1104/pp.112.205245

Keywords

-

Categories

Funding

  1. National Science Foundation [DEB-0919452, IOS-1126998, MCB-0929100]
  2. Direct For Biological Sciences
  3. Division Of Integrative Organismal Systems [1126998] Funding Source: National Science Foundation
  4. Division Of Environmental Biology
  5. Direct For Biological Sciences [0919452] Funding Source: National Science Foundation
  6. Div Of Molecular and Cellular Bioscience
  7. Direct For Biological Sciences [1119778] Funding Source: National Science Foundation

Ask authors/readers for more resources

The Arabidopsis (Arabidopsis thaliana) genome is the most well-annotated plant genome. However, transcriptome sequencing in Arabidopsis continues to suggest the presence of polyadenylated (polyA) transcripts originating from presumed intergenic regions. It is not clear whether these transcripts represent novel noncoding or protein-coding genes. To understand the nature of intergenic polyA transcription, we first assessed its abundance using multiple messenger RNA sequencing data sets. We found 6,545 intergenic transcribed fragments (ITFs) occupying 3.6% of Arabidopsis intergenic space. In contrast to transcribed fragments that map to protein-coding and RNA genes, most ITFs are significantly shorter, are expressed at significantly lower levels, and tend to be more data set specific. A surprisingly large number of ITFs (32.1%) may be protein coding based on evidence of translation. However, our results indicate that these translated ITFs tend to be close to and are likely associated with known genes. To investigate if ITFs are under selection and are functional, we assessed ITF conservation through cross-species as well as within-species comparisons. Our analysis reveals that 237 ITFs, including 49 with translation evidence, are under strong selective constraint and relatively distant from annotated features. These ITFs are likely parts of novel genes. However, the selective pressure imposed on most ITFs is similar to that of randomly selected, untranscribed intergenic sequences. Our findings indicate that despite the prevalence of ITFs, apart from the possibility of genomic contamination, many may be background or noisy transcripts derived from junk DNA, whose production may be inherent to the process of transcription and which, on rare occasions, may act as catalysts for the creation of novel genes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available