4.8 Article

QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity

Journal

NUCLEIC ACIDS RESEARCH
Volume 50, Issue 13, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkac220

Keywords

-

Funding

  1. Japan Society for the Promotion of Science [JP21K15075]
  2. Japan Science and Technology Agency (JST) CREST [JPMJCR18S1]

Ask authors/readers for more resources

This paper proposes a method for extracting secondary structure features that affect the functional activity of RNA from sequence-activity data. The method is applied to different experimental systems and provides detailed insights into the structure-activity relationships of RNA.
Recent technological advances have enabled the generation of large amounts of data consisting of RNA sequences and their functional activity. Here, we propose a method for extracting secondary structure features that affect the functional activity of RNA from sequence-activity data. Given pairs of RNA sequences and their corresponding bioactivity values, our method calculates position-specific structural features of the input RNA sequences, considering every possible secondary structure of each RNA. A Ridge regression model is trained using the structural features as feature vectors and the bioactivity values as response variables. Optimized model parameters indicate how secondary structure features affect bioactivity. We used our method to extract intramolecular structural features of bacterial translation initiation sites and self-cleaving ribozymes, and the intermolecular features between rRNAs and Shine-Dalgarno sequences and between U1 RNAs and splicing sites. We not only identified known structural features but also revealed more detailed insights into structure-activity relationships than previously reported. Importantly, the datasets we analyzed here were obtained from different experimental systems and differed in size, sequence length and similarity, and number of RNA molecules involved, demonstrating that our method is applicable to various types of data consisting of RNA sequences and bioactivity values.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available