4.5 Article

Binning Metagenomic Contigs Using Unsupervised Clustering and Reference Databases

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s12539-022-00526-y

关键词

Metagenomics; Unsupervised clustering; Reference databases; Binning

资金

  1. National Natural Science Foundation of China [61373057]

向作者/读者索取更多资源

This study proposes a new method for metagenomic contig binning, which utilizes unsupervised clustering methods and reference databases to address the challenges faced by current methods. Experimental results demonstrate that this method can improve the binning effect.
Metagenomics can directly extract the genetic material of all microorganisms from the environment, and obtain metagenomic samples with a large number of unknown DNA sequences. Binning of metagenomic contigs is a hot topic in metagenomics research. There are two key challenges for the current unsupervised metagenomic clustering algorithms. First, unsupervised metagenomic clustering methods rarely use reference databases, causing a certain waste of resources. Second, unsupervised metagenomic clustering methods are restricted by the characteristics of the sequences and the clustering algorithms, and the binning effect is limited. Therefore, a new binning method for metagenomic contigs using unsupervised clustering methods and reference databases is proposed to address these challenges, to make full use of the advantages of unsupervised clustering methods and reference databases constructed by scientists to improve the overall binning effect. This method uses the integrated SVM classification model to further bin the unsupervised clustering parts that do not perform well. Our proposed method was tested on simulated datasets and a real dataset and compared with other state-of-the-art metagenomic clustering methods including CONCOCT, Metabin2.0, Autometa, and MetaBAT. The results show that our method can achieve higher precision rate and improve the binning effect. [GRAPHICS] .

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据