期刊
MOLECULAR BIOLOGY AND EVOLUTION
卷 30, 期 8, 页码 1987-1997出版社
OXFORD UNIV PRESS
DOI: 10.1093/molbev/mst100
关键词
duplication; gene family; adaptive evolution
资金
- National Evolutionary Synthesis Center postdoctoral fellowship
- Ford Foundation pre-doctoral fellowship
- National Science Foundation grant [DBI-0845494]
- Direct For Biological Sciences
- Div Of Biological Infrastructure [0845494] Funding Source: National Science Foundation
Current sequencing methods produce large amounts of data, but genome assemblies constructed from these data are often fragmented and incomplete. Incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. This means that methods attempting to estimate rates of gene duplication and loss often will be misled by such errors and that rates of gene family evolution will be consistently overestimated. Here, we present a method that takes these errors into account, allowing one to accurately infer rates of gene gain and loss among genomes even with low assembly and annotation quality. The method is implemented in the newest version of the software package CAFE, along with several other novel features. We demonstrate the accuracy of the method with extensive simulations and reanalyze several previously published data sets. Our results show that errors in genome annotation do lead to higher inferred rates of gene gain and loss but that CAFE 3 sufficiently accounts for these errors to provide accurate estimates of important evolutionary parameters.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据