4.5 Article

Dashing: fast and accurate genomic distances with HyperLogLog

期刊

GENOME BIOLOGY
卷 20, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s13059-019-1875-0

关键词

Sketch data structures; Hyperloglog; Metagenomics; Alignment; Sequencing; Genomic distance

资金

  1. National Science Foundation [IIS-1349906]
  2. National Institutes of Health/National Institute of General Medical Sciences [R01GM118568]

向作者/读者索取更多资源

Dashing is a fast and accurate software tool for estimating similarities of genomes or sequencing datasets. It uses the HyperLogLog sketch together with cardinality estimation methods that are specialized for set unions and intersections. Dashing summarizes genomes more rapidly than previous MinHash-based methods while providing greater accuracy across a wide range of input sizes and sketch sizes. It can sketch and calculate pairwise distances for over 87K genomes in 6 minutes. Dashing is open source and available at https://github.com/dnbaker/dashing.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据