4.6 Article

GenErode: a bioinformatics pipeline to investigate genome erosion in endangered and extinct species

期刊

BMC BIOINFORMATICS
卷 23, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12859-022-04757-0

关键词

Bioinformatics; Temporal genomic erosion; Conservation genomics; Whole genome re-sequencing data; Endangered species; Reproducibility; Snakemake; Ancient DNA

资金

  1. Stockholm University
  2. Carl Tryggers Foundation [CTS 17:109, CTS 19:257]
  3. European Union [796877]
  4. Swedish Research Council [2017-04647, 2018-05973]
  5. FORMAS [2015-676]
  6. Swiss National Science Foundation [P2SKP3_165031, P300PA_177845]
  7. Knut and Alice Wallenberg Foundation as part of the National Bioinformatics Infrastructure Sweden at SciLifeLab
  8. Swiss National Science Foundation (SNF) [P300PA_177845, P2SKP3_165031] Funding Source: Swiss National Science Foundation (SNF)
  9. Swedish Research Council [2017-04647] Funding Source: Swedish Research Council
  10. Marie Curie Actions (MSCA) [796877] Funding Source: Marie Curie Actions (MSCA)

向作者/读者索取更多资源

The study presents a flexible, scalable, and modular pipeline for comparing patterns of genomic erosion in samples from different time periods. The pipeline utilizes state-of-the-art bioinformatics tools to process whole-genome re-sequencing data from ancient/historical and modern samples, and produces comparable estimates of genomic erosion indices.
Background: Many wild species have suffered drastic population size declines over the past centuries, which have led to 'genomic erosion' processes characterized by reduced genetic diversity, increased inbreeding, and accumulation of harmful mutations. Yet, genomic erosion estimates of modern-day populations often lack concordance with dwindling population sizes and conservation status of threatened species. One way to directly quantify the genomic consequences of population declines is to compare genome-wide data from pre-decline museum samples and modern samples. However, doing so requires computational data processing and analysis tools specifically adapted to comparative analyses of degraded, ancient or historical, DNA data with modern DNA data as well as personnel trained to perform such analyses. Results: Here, we present a highly flexible, scalable, and modular pipeline to compare patterns of genomic erosion using samples from disparate time periods. The GenErode pipeline uses state-of-the-art bioinformatics tools to simultaneously process whole-genome re-sequencing data from ancient/historical and modern samples, and to produce comparable estimates of several genomic erosion indices. No programming knowledge is required to run the pipeline and all bioinformatic steps are well-documented, making the pipeline accessible to users with different backgrounds. GenErode is written in Snakemake and Python3 and uses Conda and Singularity containers to achieve reproducibility on high-performance compute clusters. The source code is freely available on GitHub (https://github.com/NBISweden/GenErode). Conclusions: GenErode is a user-friendly and reproducible pipeline that enables the standardization of genomic erosion indices from temporally sampled whole genome re-sequencing data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据