4.6 Article

Bayesian inference of phylogenetic networks from bi-allelic genetic markers

Journal

PLOS COMPUTATIONAL BIOLOGY
Volume 14, Issue 1, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1005932

Keywords

-

Funding

  1. National Science Foundation [DBI-1355998, CCF-1302179]
  2. Data Analysis and Visualization Cyberinfrastructure - NSF [OCI-0959097]
  3. Rice University
  4. Big-Data Private-Cloud Research Cyberinfrastructure MRI-award - NSF [CNS-1338099]
  5. Direct For Biological Sciences
  6. Div Of Biological Infrastructure [1355998] Funding Source: National Science Foundation
  7. Division of Computing and Communication Foundations
  8. Direct For Computer & Info Scie & Enginr [1514177] Funding Source: National Science Foundation

Ask authors/readers for more resources

Phylogenetic networks are rooted, directed, acyclic graphs that model reticulate evolutionary histories. Recently, statistical methods were devised for inferring such networks from either gene tree estimates or the sequence alignments of multiple unlinked loci. Bi-allelic markers, most notably single nucleotide polymorphisms (SNPs) and amplified fragment length polymorphisms (AFLPs), provide a powerful source of genome-wide data. In a recent paper, a method called SNAPP was introduced for statistical inference of species trees from unlinked bi-allelic markers. The generative process assumed by the method combined both a model of evolution for the bi-allelic markers, as well as the multispecies coalescent. A novel component of the method was a polynomial-time algorithm for exact computation of the likelihood of a fixed species tree via integration over all possible gene trees for a given marker. Here we report on a method for Bayesian inference of phylogenetic networks from bi-allelic markers. Our method significantly extends the algorithm for exact computation of phylogenetic network likelihood via integration over all possible gene trees. Unlike the case of species trees, the algorithm is no longer polynomial-time on all instances of phylogenetic networks. Furthermore, the method utilizes a reversible-jump MCMC technique to sample the posterior of phylogenetic networks given bi-allelic marker data. Our method has a very good performance in terms of accuracy and robustness as we demonstrate on simulated data, as well as a data set of multiple New Zealand species of the plant genus Ourisia (Plantaginaceae). We implemented the method in the publicly available, open-source PhyloNet software package.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available