4.7 Article

A draft fur seal genome provides insights into factors affecting SNP validation and how to mitigate them

期刊

MOLECULAR ECOLOGY RESOURCES
卷 16, 期 4, 页码 909-921

出版社

WILEY
DOI: 10.1111/1755-0998.12502

关键词

Antarctic fur seal; Arctocephalus gazella; cross-validation; draft genome; single nucleotide polymorphism; SNP array

资金

  1. Marie Curie FP7-Reintegration-Grant within the 7th European Community Framework Programme [PCIG-GA-2011-303618]
  2. Natural Environment Research Council
  3. Knut and Alice Wallenberg Foundation
  4. Swedish Research Council FORMAS [231-2012-450]
  5. Deutsch Forchungsgemeinschaft studentship
  6. Natural Environment Research Council [bas0100035] Funding Source: researchfish
  7. NERC [bas0100035] Funding Source: UKRI

向作者/读者索取更多资源

Custom genotyping arrays provide a flexible and accurate means of genotyping single nucleotide polymorphisms (SNPs) in a large number of individuals of essentially any organism. However, validation rates, defined as the proportion of putative SNPs that are verified to be polymorphic in a population, are often very low. A number of potential causes of assay failure have been identified, but none have been explored systematically. In particular, as SNPs are often developed from transcriptomes, parameters relating to the genomic context are rarely taken into account. Here, we assembled a draft Antarctic fur seal (Arctocephalus gazella) genome (assembly size: 2.41 Gb; scaffold/contig N-50: 3.1 Mb/27.5 kb). We then used this resource to map the probe sequences of 144 putative SNPs genotyped in 480 individuals. The number of probe-to-genome mappings and alignment length together explained almost a third of the variation in validation success, indicating that sequence uniqueness and proximity to intron-exon boundaries play an important role. The same pattern was found after mapping the probe sequences to the Walrus and Weddell seal genomes, suggesting that the genomes of species divergent by as much as 23 million years can hold information relevant to SNP validation outcomes. Additionally, reanalysis of genotyping data from seven previous studies found the same two variables to be significantly associated with SNP validation success across a variety of taxa. Finally, our study reveals considerable scope for validation rates to be improved, either by simply filtering for SNPs whose flanking sequences align uniquely and completely to a reference genome, or through predictive modelling.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据