4.5 Article

Binning Metagenomic Contigs Using Unsupervised Clustering and Reference Databases

Journal

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s12539-022-00526-y

Keywords

Metagenomics; Unsupervised clustering; Reference databases; Binning

Funding

  1. National Natural Science Foundation of China [61373057]

Ask authors/readers for more resources

This study proposes a new method for metagenomic contig binning, which utilizes unsupervised clustering methods and reference databases to address the challenges faced by current methods. Experimental results demonstrate that this method can improve the binning effect.
Metagenomics can directly extract the genetic material of all microorganisms from the environment, and obtain metagenomic samples with a large number of unknown DNA sequences. Binning of metagenomic contigs is a hot topic in metagenomics research. There are two key challenges for the current unsupervised metagenomic clustering algorithms. First, unsupervised metagenomic clustering methods rarely use reference databases, causing a certain waste of resources. Second, unsupervised metagenomic clustering methods are restricted by the characteristics of the sequences and the clustering algorithms, and the binning effect is limited. Therefore, a new binning method for metagenomic contigs using unsupervised clustering methods and reference databases is proposed to address these challenges, to make full use of the advantages of unsupervised clustering methods and reference databases constructed by scientists to improve the overall binning effect. This method uses the integrated SVM classification model to further bin the unsupervised clustering parts that do not perform well. Our proposed method was tested on simulated datasets and a real dataset and compared with other state-of-the-art metagenomic clustering methods including CONCOCT, Metabin2.0, Autometa, and MetaBAT. The results show that our method can achieve higher precision rate and improve the binning effect. [GRAPHICS] .

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available