4.5 Article

Gene tree quality affects empirical coalescent branch length estimation

Journal

ZOOLOGICA SCRIPTA
Volume 51, Issue 1, Pages 1-13

Publisher

WILEY
DOI: 10.1111/zsc.12512

Keywords

coalescent branches; exons; gene trees; locus informativeness; polytomy; ultraconserved elements

Funding

  1. Division of Environmental Biology [1118823, 1655683]
  2. Division Of Environmental Biology
  3. Direct For Biological Sciences [1655683] Funding Source: National Science Foundation
  4. Division Of Environmental Biology
  5. Direct For Biological Sciences [1118823] Funding Source: National Science Foundation

Ask authors/readers for more resources

The study found that gene tree error can affect the estimation of coalescent branch lengths, with low informative loci leading to significantly shorter CBLs. Gene tree resolution has a greater impact on UCE datasets, while it does not seem to affect exon datasets as much.
Assessing effects of gene tree error in coalescent analyses have widely ignored coalescent branch lengths (CBLs) despite their potential utility in estimating ancestral population demographics and detecting species tree anomaly zones. However, the ability of coalescent methods to obtain accurate estimates remains largely unexplored. Errors in gene trees should lead to underestimates of the true CBL, and for a given set of comparisons, longer CBLs should be more accurate. Here, we furthered our empirical understanding of how error in gene tree quality (i.e., locus informativeness and gene tree resolution) affect CBLs using four datasets comprised of ultraconserved elements (UCE) or exons for clades that exhibit wide ranges of branch lengths. For each dataset, we compared the impact of locus informativeness (assessed using number of parsimony-informative sites) and gene tree resolution on CBL estimates. Our results, in general, showed that CBLs were drastically shorter when estimates included low informative loci. Gene tree resolution also had an impact on UCE datasets, with polytomous gene trees producing longer branches than randomly resolved gene trees. However, resolution did not appear to affect CBL estimates from the more informative exon datasets. Thus, as expected, gene tree quality affects CBL estimates, though this can generally be minimized by using moderate filtering to select more informative loci and/or by allowing polytomies in gene trees. These approaches, as well as additional contributions to improve CBL estimation, should lead to CBLs that are useful for addressing evolutionary and biological questions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available