4.4 Article

Discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis

Journal

RNA
Volume 29, Issue 11, Pages 1644-1657

Publisher

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1261/rna.079541.122

Keywords

ribozyme; fitness landscape; epistasis; mutual information; surprisal

Ask authors/readers for more resources

The identification of catalytic RNAs through experimental means is limited by the available sequence space. In this study, the researchers propose a method to predict additional ribozyme sequences by extrapolating from a limited data set, using information theory and a simple model of the epistatic fitness landscape. The predictions were experimentally validated, confirming the accuracy of the approach and discovering novel evolutionary links between ribozyme families.
The identification of catalytic RNAs is typically achieved through primarily experimental means. However, only a small fraction of sequence space can be analyzed even with high-throughput techniques. Methods to extrapolate from a limited data set to predict additional ribozyme sequences, particularly in a human-interpretable fashion, could be useful both for designing new functional RNAs and for generating greater understanding about a ribozyme fitness landscape. Using information theory, we express the effects of epistasis (i.e., deviations from additivity) on a ribozyme. This representation was incorporated into a simple model of the epistatic fitness landscape, which identified potentially exploitable combinations of mutations. We used this model to theoretically predict mutants of high activity for a self-aminoacylating ribozyme, identifying potentially active triple and quadruple mutants beyond the experimental data set of single and double mutants. The predictions were validated experimentally, with nine out of nine sequences being accurately predicted to have high activity. This set of sequences included mutants that form a previously unknown evolutionary bridge between two ribozyme families that share a common motif. Individual steps in the method could be examined, understood, and guided by a human, combining interpretability and performance in a simple model to predict ribozyme sequences by extrapolation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available