4.7 Article

Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations

期刊

AMERICAN JOURNAL OF HUMAN GENETICS
卷 108, 期 4, 页码 656-668

出版社

CELL PRESS
DOI: 10.1016/j.ajhg.2021.03.012

关键词

-

资金

  1. Stanley Center for Psychiatric Research at the Broad Institute
  2. National Institutes of Health [K99/R00MH117229, K01MH121659, T32MH017119]
  3. Medical Research Council
  4. Department for International Development through the Africa Research Leader scheme
  5. [R01MH120642]

向作者/读者索取更多资源

Genetic studies in underrepresented populations often find novel associations, and sequencing whole genomes at low coverage (>= 4x) can capture variants more accurately and at a similar cost than commonly used genotyping arrays. Low-coverage sequencing approaches are effective in identifying novel variation in underrepresented populations and offer opportunities for enhancing variant discovery.
Genetic studies in underrepresented populations identify disproportionate numbers of novel associations. However, most genetic studies use genotyping arrays and sequenced reference panels that best capture variation most common in European ancestry populations. To compare data generation strategies best suited for underrepresented populations, we sequenced the whole genomes of 91 individuals to high coverage as part of the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study with participants from Ethiopia, Kenya, South Africa, and Uganda. We used a downsampling approach to evaluate the quality of two cost-effective data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole-genome sequencing data. We show that low-coverage sequencing at a depth of >= 4x captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1x) performed comparably to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation; 43 sequencing detects 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, effectively identify novel variation particularly in underrepresented populations, and present opportunities to enhance variant discovery at a cost similar to traditional approaches.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据