期刊
GENETICS
卷 179, 期 3, 页码 1409-1424出版社
GENETICS SOCIETY AMERICA
DOI: 10.1534/genetics.107.082198
关键词
-
Many data sets one could use for population genetics contain artifactual sites, i.e., sequencing errors. Here we first explore the impact of such errors on several common summary statistics, assuming that sequencing errors are mostly singletons. We thus show that in the presence of those errors, estimators of theta can be strongly based. We further show that even with a moderate number of sequencing errors, neutrality tests based on the frequecncy spectrum reject neutrality. This implies that analyses of data sets with such errors will systematically lead to wrong inferences of evolutionary scenarios. To avoid to these errors, we propose two new estimators of theta that ignore singletons as well as two new tests Y and Y* that can be used to test neutrality despite sequencing errors. All in all, we show that even though singletons are ignored, these new tests show some power to detect deviations from a standard neutral model. We therefore advise the use of, these new tests to strengthen conclusions in suspicious data sets.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据