4.7 Article

Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe

Journal

GENOME RESEARCH
Volume 33, Issue 1, Pages 129-140

Publisher

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.277340.122

Keywords

-

Ask authors/readers for more resources

Horizontal gene transfer (HGT) is important for the evolution and diversification of microbial species. Existing methods for analyzing gene presence/absence patterns do not consider errors in annotation and clustering. The new method Panstripe, based on generalized linear regression, can effectively identify differences in HGT events by accounting for population structure and errors in gene prediction.
Horizontal gene transfer (HGT) plays a critical role in the evolution and diversification of many microbial species. The resulting dynamics of gene gain and loss can have important implications for the development of antibiotic resistance and the design of vaccine and drug interventions. Methods for the analysis of gene presence/absence patterns typically do not account for errors introduced in the automated annotation and clustering of gene sequences. In particular, methods adapted from ecological studies, including the pangenome gene accumulation curve, can be misleading as they may reflect the underlying diversity in the temporal sampling of genomes rather than a difference in the dynamics of HGT. Here, we introduce Panstripe, a method based on generalized linear regression that is robust to population structure, sampling bias, and errors in the predicted presence/absence of genes. We show using simulations that Panstripe can effectively identify differences in the rate and number of genes involved in HGT events, and illustrate its capability by analyzing several diverse bacterial genome data sets representing major human pathogens.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available