4.8 Review

Legacy Data Confound Genomics Studies

期刊

MOLECULAR BIOLOGY AND EVOLUTION
卷 37, 期 1, 页码 2-10

出版社

OXFORD UNIV PRESS
DOI: 10.1093/molbev/msz201

关键词

batch effect; mutational signature; statistical genetics; population genetics; reference cohorts; imputation

向作者/读者索取更多资源

Recent reports have identified differences in the mutational spectra across human populations. Although some of these reports have been replicated in other cohorts, most have been reported only in the 1000 Genomes Project (1kGP) data. While investigating an intriguing putative population stratification within the Japanese population, we identified a previously unreported batch effect leading to spurious mutation calls in the 1kGP data and to the apparent population stratification. Because the 1kGP data are used extensively, we find that the batch effects also lead to incorrect imputation by leading imputation servers and a small number of suspicious GWAS associations. Lower quality data from the early phases of the 1kGP thus continue to contaminate modern studies in hidden ways. It may be time to retire or upgrade such legacy sequencing data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据