4.7 Article

Limits and convergence properties of the sequentially Markovian coalescent

期刊

MOLECULAR ECOLOGY RESOURCES
卷 21, 期 7, 页码 2231-2248

出版社

WILEY
DOI: 10.1111/1755-0998.13416

关键词

ancestral recombination graph; kingman coalescent; population genetics

资金

  1. Deutsche Forschungsgemeinschaft [317616126 (TE809/7-1)]
  2. Alexander von Humboldt Stiftung

向作者/读者索取更多资源

Various methods based on SMC have been developed to infer population demographic history using full genome sequence data, but a comprehensive investigation into the limitations of these methods is lacking. Researchers have introduced a tool to infer the best case convergence of SMC methods and evaluated the inference accuracy when assumptions are violated. Additionally, they emphasize the importance of the transition matrix in SMC methods.
Several methods based on the sequentially Markovian coalescent (SMC) make use of full genome sequence data from samples to infer population demographic history including past changes in population size, admixture, migration events and population structure. More recently, the original theoretical framework has been extended to allow the simultaneous estimation of population size changes along with other life history traits such as selfing or seed banking. The latter developments enhance the applicability of SMC methods to nonmodel species. Although convergence proofs have been given using simulated data in a few specific cases, an in-depth investigation of the limitations of SMC methods is lacking. In order to explore such limits, we first develop a tool inferring the best case convergence of SMC methods assuming the true underlying coalescent genealogies are known. This tool can be used to quantify the amount and type of information that can be confidently retrieved from given data sets prior to the analysis of the real data. Second, we assess the inference accuracy when the assumptions of SMC approaches are violated due to departures from the model, namely the presence of transposable elements, variable recombination and mutation rates along the sequence, and SNP calling errors. Third, we deliver a new interpretation of SMC methods by highlighting the importance of the transition matrix, which we argue can be used as a set of summary statistics in other statistical inference methods, uncoupling the SMC from hidden Markov models (HMMs). We finally offer recommendations to better apply SMC methods and build adequate data sets under budget constraints.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据