4.7 Article

AMaLa: Analysis of Directed Evolution Experiments via Annealed Mutational Approximated Landscape

Journal

Publisher

MDPI
DOI: 10.3390/ijms222010908

Keywords

computational biology; statistical modeling; fitness landscape; Directed Evolution; Deep Mutational Scanning; direct-coupling analysis

Ask authors/readers for more resources

AMaLa is a new method for inferring fitness landscapes from Directed Evolution experiments sequencing data, effectively leveraging the information encoded in the entire time evolution and using a time-dependent statistical weight with two contributions to gauge possible trajectories in sequence space. This approach accurately describes Directed Evolution dynamics and infers a fitness landscape that reproduces measures of the phenotype under selection, outperforming widely used inference strategies. The reliability of AMaLa is assessed by demonstrating how the inferred statistical model can predict relevant structural properties of the wild-type sequence.
We present Annealed Mutational approximated Landscape (AMaLa), a new method to infer fitness landscapes from Directed Evolution experiments sequencing data. Such experiments typically start from a single wild-type sequence, which undergoes Darwinian in vitro evolution via multiple rounds of mutation and selection for a target phenotype. In the last years, Directed Evolution is emerging as a powerful instrument to probe fitness landscapes under controlled experimental conditions and as a relevant testing ground to develop accurate statistical models and inference algorithms (thanks to high-throughput screening and sequencing). Fitness landscape modeling either uses the enrichment of variants abundances as input, thus requiring the observation of the same variants at different rounds or assuming the last sequenced round as being sampled from an equilibrium distribution. AMaLa aims at effectively leveraging the information encoded in the whole time evolution. To do so, while assuming statistical sampling independence between sequenced rounds, the possible trajectories in sequence space are gauged with a time-dependent statistical weight consisting of two contributions: (i) an energy term accounting for the selection process and (ii) a generalized Jukes-Cantor model for the purely mutational step. This simple scheme enables accurately describing the Directed Evolution dynamics and inferring a fitness landscape that correctly reproduces the measures of the phenotype under selection (e.g., antibiotic drug resistance), notably outperforming widely used inference strategies. In addition, we assess the reliability of AMaLa by showing how the inferred statistical model could be used to predict relevant structural properties of the wild-type sequence.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available