4.7 Article

Effects of missing data and data type on phylotranscriptomic analysis of stony corals (Cnidaria: Anthozoa: Scleractinia)

期刊

MOLECULAR PHYLOGENETICS AND EVOLUTION
卷 134, 期 -, 页码 12-23

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.ympev.2019.01.012

关键词

-

资金

  1. National Research Foundation, Prime Minister's Office, Singapore under its Marine Science RD Programme [MSRDP-P03]

向作者/读者索取更多资源

Across the tree of life, phylogenetic analysis is increasingly being performed using transcriptome data. As a result of heterogeneous gene expression within individual organisms and unequal sequencing depth between samples, coverage of homologous loci in such datasets is typically inhomogeneous. Consequently, missing data are a common feature of phylotranscriptomic inference, but their impact on phylogenetic analysis remains poorly characterised empirically. Considering the complexity of the evolutionary history of stony corals (Cnidaria: Anthozoa: Scleractinia), transcriptome data hold great promise for resolving their phylogeny, particularly if there is a good understanding of missing data and data type (either amino acid or DNA) effects. Here, we reconstructed a broad phylogenetic tree of 39 scleractinian species with 3 corallimorpharians as outgroups, including 15 transcriptomes that were newly sequenced and assembled in this study. Between 63 and 505 loci were used to analyse the scleractinian phylogeny, and we quantified differences in tree topology, tree shape, bootstrap support and effects of conflicting gene trees among datasets of varying completeness for both amino acid and DNA sequences. Even with almost 70% missing data, tree topologies appear to be mostly unaffected, although there are higher incongruence levels in the less complete datasets. Furthermore, DNA trees outperform amino acid trees in bootstrap support and robustness against incongruent loci. Overall, our findings indicate that high levels of missing data can still produce expected tree topologies, but identifying and omitting incongruent loci can lead to more consistent branch length estimates.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据