4.6 Article

RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application

Journal

GENES
Volume 12, Issue 1, Pages -

Publisher

MDPI
DOI: 10.3390/genes12010014

Keywords

RNA secondary structure prediction; scanning algorithm; hyper-stable RNA elements

Funding

  1. German Federal Ministry for Education and Research [BMBF 031A538B]

Ask authors/readers for more resources

The accuracy of RNA secondary structure prediction is affected by the span of base pairs, and algorithms can be specialized to optimize memory usage, but this may sacrifice finding the globally optimal structure. An efficient backtracking algorithm has been integrated into the ViennaRNA package for reconstructing the globally optimal structure, while constraining the z-scores can help identify hyper-stable structural elements in genomic sequences.
The accuracy of RNA secondary structure prediction decreases with the span of a base pair, i.e., the number of nucleotides that it encloses. The dynamic programming algorithms for RNA folding can be easily specialized in order to consider only base pairs with a limited span L, reducing the memory requirements to O(nL), and further to O(n) by interleaving backtracking. However, the latter is an approximation that precludes the retrieval of the globally optimal structure. So far, the ViennaRNA package therefore does not provide a tool for computing optimal, span-restricted minimum energy structure. Here, we report on an efficient backtracking algorithm that reconstructs the globally optimal structure from the locally optimal fragments that are produced by the interleaved backtracking implemented in RNALfold. An implementation is integrated into the ViennaRNA package. The forward and the backtracking recursions of RNALfold are both easily constrained to structural components with a sufficiently negative z-scores. This provides a convenient method in order to identify hyper-stable structural elements. A screen of the C. elegans genome shows that such features are more abundant in real genomic sequences when compared to a di-nucleotide shuffled background model.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available