4.6 Article

Comprehensive Characterization of Human Genome Variation by High Coverage Whole-Genome Sequencing of Forty Four Caucasians

期刊

PLOS ONE
卷 8, 期 4, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pone.0059494

关键词

-

资金

  1. National Institutes of Health [P50AR055081, R01AG026564, R01AR050496, R01AR057049, R03TW008221]
  2. Franklin D. Dickson/Missouri Endowment from University of Missouri-Kansas City
  3. Edward G. Schlieder Endowment from Tulane University
  4. Shanghai Leading Academic Discipline Project [S30501]

向作者/读者索取更多资源

Whole genome sequencing studies are essential to obtain a comprehensive understanding of the vast pattern of human genomic variations. Here we report the results of a high-coverage whole genome sequencing study for 44 unrelated healthy Caucasian adults, each sequenced to over 50-fold coverage (averaging 65.8x). We identified approximately 11 million single nucleotide polymorphisms (SNPs), 2.8 million short insertions and deletions, and over 500,000 block substitutions. We showed that, although previous studies, including the 1000 Genomes Project Phase 1 study, have catalogued the vast majority of common SNPs, many of the low-frequency and rare variants remain undiscovered. For instance, approximately 1.4 million SNPs and 1.3 million short indels that we found were novel to both the dbSNP and the 1000 Genomes Project Phase 1 data sets, and the majority of which (similar to 96%) have a minor allele frequency less than 5%. On average, each individual genome carried similar to 3.3 million SNPs and similar to 492,000 indels/block substitutions, including approximately 179 variants that were predicted to cause loss of function of the gene products. Moreover, each individual genome carried an average of 44 such loss-of-function variants in a homozygous state, which would completely knock out the corresponding genes. Across all the 44 genomes, a total of 182 genes were knocked-out in at least one individual genome, among which 46 genes were knocked out in over 30% of our samples, suggesting that a number of genes are commonly knocked-out in general populations. Gene ontology analysis suggested that these commonly knocked-out genes are enriched in biological process related to antigen processing and immune response. Our results contribute towards a comprehensive characterization of human genomic variation, especially for less-common and rare variants, and provide an invaluable resource for future genetic studies of human variation and diseases.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据