☆ 4.7 Article

snpfiltr: An R package for interactive and reproducible SNP filtering

MOLECULAR ECOLOGY RESOURCES (2022)

期刊

MOLECULAR ECOLOGY RESOURCES

卷 22, 期 6, 页码 2443-2453

出版社

WILEY

DOI: 10.1111/1755-0998.13618

关键词

bioinfomatics; phyloinfomatics; genomics; missing data; R package; reproducibility; SNP filtering

类别

Biochemistry & Molecular Biology Ecology Evolutionary Biology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The SNPfiltR package in R provides functionalities for customizable and reproducible SNP filtering pipeline. It extends existing SNP filtering functionalities by automating the visualization of key parameters.

Here, I describe the R package snpfiltr and demonstrate its functionality as the backbone of a customizable, reproducible single nucleotide polymorphism (SNP) filtering pipeline implemented exclusively via the widely adopted R programming language. SNPfiltR extends existing SNP filtering functionalities by automating the visualization of key parameters such as sequencing depth, quality, and missing data proportion, allowing users to visually optimize and implement filtering thresholds within a single, cohesive work session. All SNPfiltR functions require vcfr objects as input, which can be easily generated by reading a SNP data set stored in standard variant call format (vcf) into an R working environment using the function read.vcfR() from the R package vcfr. Performance and accuracy benchmarking reveal that for moderately sized SNP data sets (up to 50 M genotypes, plus associated quality information), SNPfiltR performs filtering with comparable accuracy and efficiency to current state of the art command-line-based programs. These results indicate that for most reduced-representation genomic data sets, SNPfiltR is an ideal choice for investigating, visualizing, and filtering SNPs as part of a user friendly bioinformatic pipeline. The snpfiltr package can be downloaded from CRAN with the command install.packages(snpfiltr), and the current development version is available from GitHub at: (). Thorough documentation for SNPfiltR, including multiple comprehensive vignettes detailing realistic use-cases, is available at the website: .

snpfiltr: An R package for interactive and reproducible SNP filtering

期刊

MOLECULAR ECOLOGY RESOURCES

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

snpfiltr: An R package for interactive and reproducible SNP filtering

期刊

MOLECULAR ECOLOGY RESOURCES

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文