4.7 Article

Joint identification of sex and sex-linked scaffolds in non-model organisms using low depth sequencing data

期刊

MOLECULAR ECOLOGY RESOURCES
卷 22, 期 2, 页码 458-467

出版社

WILEY
DOI: 10.1111/1755-0998.13491

关键词

autosomes; bioinformatics; resequencing; scaffold-level assembly

资金

  1. Det Frie Forskningsrad [DFF8049-00098B]
  2. European Research Council [853442]
  3. Carlsbergfondet [CF19-0427]
  4. Novo Nordisk Fonden [NNF20OC0061343]
  5. Lundbeckfonden [R215-2015-4174]
  6. European Research Council (ERC) [853442] Funding Source: European Research Council (ERC)

向作者/读者索取更多资源

The SATC method is proposed to assign sex to samples and identify sex-linked scaffolds from NGS data, working for species with a homogametic/heterogametic sex determination system. It uses scaffold depth distribution and PCA to achieve sex assignment and sex-linked scaffold identification, without prior knowledge of sample sex, using WGS data.
Being able to assign sex to individuals and identify autosomal and sex-linked scaffolds are essential in most population genomic analyses. Non-model organisms often have genome assemblies at scaffold-level and lack characterization of sex-linked scaffolds. Previous methods to identify sex and sex-linked scaffolds have relied on synteny between the non-model organism and a closely related species or prior knowledge about the sex of the samples to identify sex-linked scaffolds. In the latter case, the difference in depth of coverage between the autosomes and the sex chromosomes are used. Here, we present sex assignment through coverage (SATC), a method to assign sex to samples and identify sex-linked scaffolds from next generation sequencing (NGS) data. The method works for species with a homogametic/heterogametic sex determination system and only requires a scaffold-level reference assembly and sampling of both sexes with whole genome sequencing (WGS) data. We use the sequencing depth distribution across scaffolds to jointly identify: (i) male and female individuals, and (ii) sex-linked scaffolds. This is achieved through projecting the scaffold depths into a low-dimensional space using principal component analysis (PCA) and subsequent Gaussian mixture clustering. We demonstrate the applicability of our method using data from five mammal species and a bird species complex. The method is freely available at as R code and a graphical user interface (GUI).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据