4.4 Article

THE DIVERSITY OF A DISTRIBUTED GENOME IN BACTERIAL POPULATIONS

期刊

ANNALS OF APPLIED PROBABILITY
卷 20, 期 5, 页码 1567-1606

出版社

INST MATHEMATICAL STATISTICS
DOI: 10.1214/09-AAP657

关键词

Kingman's coalescent; infinitely many genes model; infinitely many sites model; gene content

资金

  1. BMBF [0313921]

向作者/读者索取更多资源

The distributed genome hypothesis states that the set of genes in a population of bacteria is distributed over all individuals that belong to the specific taxon. It implies that certain genes can be gained and lost from generation to generation. We use the random genealogy given by a Kingman coalescent in order to superimpose events of gene gain and loss along ancestral lines. Gene gains occur at a constant rate along ancestral lines. We assume that gained genes have never been present in the population before. Gene losses occur at a rate proportional to the number of genes present along the ancestral line. In this infinitely many genes model we derive moments for several statistics within a sample: the average number of genes per individual, the average number of genes differing between individuals, the number of incongruent pairs of genes, the total number of different genes in the sample and the gene frequency spectrum. We demonstrate that the model gives a reasonable fit with gene frequency data from marine cyanobacteria.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据