4.7 Article

KMCP: accurate metagenomic profiling of both prokaryotic and viral populations by pseudo-mapping

向作者/读者索取更多资源

This study developed a K-mer-based metagenomic profiling tool, KMCP, which utilizes genome coverage information and a modified Compact Bit-Sliced Signature Index for fast alignment-free sequence searching. Experimental results demonstrate that KMCP allows accurate taxonomic profiling of prokaryotic and viral populations and provides more reliable pathogen detection in low-depth clinical samples.
Motivation: The growing number of microbial reference genomes enables the improvement of metagenomic profiling accuracy but also imposes greater requirements on the indexing efficiency, database size and runtime of taxonomic profilers. Additionally, most profilers focus mainly on bacterial, archaeal and fungal populations, while less attention is paid to viral communities. Results: We present KMCP (K-mer-based Metagenomic Classification and Profiling), a novel k-mer-based metagenomic profiling tool that utilizes genome coverage information by splitting the reference genomes into chunks and stores k-mers in a modified and optimized Compact Bit-Sliced Signature Index for fast alignment-free sequence searching. KMCP combines k-mer similarity and genome coverage information to reduce the false positive rate of k-mer-based taxonomic classification and profiling methods. Benchmarking results based on simulated and real data demonstrate that KMCP, despite a longer running time than all other methods, not only allows the accurate taxonomic profiling of prokaryotic and viral populations but also provides more confident pathogen detection in clinical samples of low depth. Availability and implementation: The software is open-source under the MIT license and available at https://github.com/shenwei356/kmcp. Contact: shenwei356@cqmu.edu.cn or hp_cq@163.com or hupengcq@hospital.cqmu.edu.cn or renhong0531@cqmu.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据