4.5 Article

A Fiedler Vector Scoring Approach for Novel RNA Motif Selection

Journal

JOURNAL OF PHYSICAL CHEMISTRY B
Volume 125, Issue 4, Pages 1144-1155

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.jpcb.0c10685

Keywords

-

Funding

  1. National Institute of General Medical Sciences, National Institutes of Health (NIH) [R35GM122562]
  2. RAPID Award from the National Science Foundation, Division of Mathematical Sciences [2030377]
  3. Philip Morris USA Inc.
  4. Philip Morris International

Ask authors/readers for more resources

Novel RNA motif design is crucial for technology and medicine, with computational methods playing an increasingly important role. By clustering RNA graphs and utilizing new design features, the accuracy and efficiency of RNA design can be significantly improved.
Novel RNA motif design is of great practical importance for technology and medicine. Increasingly, computational design plays an important role in such efforts. Our coarse-grained RAG (RNA-As-Graphs) framework offers strategies for enumerating the universe of RNA 2D folds, selecting RNA-like candidates for design, and determining sequences that fold onto these candidates. In RAG, RNA secondary structures are represented as tree or dual graphs. Graphs with known RNA structures are called existing, and the others are labeled hypothetical. By using simplified features for RNA graphs, we have clustered the hypothetical graphs into RNA-like and non-RNA-like groups and proposed RNA-like graphs as candidates for design. Here, we propose a new way of designing graph features by using Fiedler vectors. The new features reflect graph shapes better, and they lead to a more clustered organization of existing graphs. We show significant increases in K-means clustering accuracy by using the new features (e.g., up to 95% and 98% accuracy for tree and dual graphs, respectively). In addition, we propose a scoring model for top graph candidate selection. This scoring model allows users to set a threshold for candidates, and it incorporates weighing of existing graphs based on their corresponding number of known RNAs. We include a list of top scored RNA-like candidates, which we hope will stimulate future novel RNA design.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available