4.5 Article

Predicting Gene Expression Divergence between Single-Copy Orthologs in Two Species

Journal

GENOME BIOLOGY AND EVOLUTION
Volume 15, Issue 5, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/gbe/evad078

Keywords

gene expression; expression divergence; Ornstein-Uhlenbeck; machine learning; neural network

Ask authors/readers for more resources

This study presents PiXi, the first machine learning framework for predicting gene expression divergence between single-copy orthologs. PiXi models gene expression evolution as an Ornstein-Uhlenbeck process and combines it with multi-layer neural network, random forest, and support vector machine architectures for predictions. PiXi achieves high accuracy and precision in predicting gene expression divergence and estimating expression optima in two species.
Predicting gene expression divergence is integral to understanding the emergence of new biological functions and associated traits. Whereas several sophisticated methods have been developed for this task, their applications are either limited to duplicate genes or require expression data from more than two species. Thus, here we present PredIcting eXpression dIvergence (PiXi), the first machine learning framework for predicting gene expression divergence between single-copy orthologs in two species. PiXi models gene expression evolution as an Ornstein-Uhlenbeck process, and overlays this model with multi-layer neural network (NN), random forest, and support vector machine architectures for making predictions. It outputs the predicted class conserved or diverged for each pair of orthologs, as well as their predicted expression optima in the two species. We show that PiXi has high power and accuracy in predicting gene expression divergence between single-copy orthologs, as well as high accuracy and precision in estimating their expression optima in the two species, across a wide range of evolutionary scenarios, with the globally best performance achieved by a multi-layer NN. Moreover, application of our best-performing PiXi predictor to empirical gene expression data from single-copy orthologs residing at different loci in two species of Drosophila reveals that approximately 23% underwent expression divergence after positional relocation. Further analysis shows that several of these diverged genes are involved in the electron transport chain of the mitochondrial membrane, suggesting that new chromatin environments may impact energy production in Drosophila. Thus, by providing a toolkit for predicting gene expression divergence between single-copy orthologs in two species, PiXi can shed light on the origins of novel phenotypes across diverse biological processes and study systems.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available