☆ 4.4 Article

Eu-Detect: An algorithm for detecting eukaryotic sequences in metagenomic data sets

JOURNAL OF BIOSCIENCES (2011)

期刊

JOURNAL OF BIOSCIENCES

卷 36, 期 4, 页码 709-717

出版社

INDIAN ACAD SCIENCES

DOI: 10.1007/s12038-011-9105-2

关键词

Alignment-free; feature vector space; metagenomics; micro-eukaryotes; oligonucleotide composition

类别

Biology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Physical partitioning techniques are routinely employed (during sample preparation stage) for segregating the prokaryotic and eukaryotic fractions of metagenomic samples. In spite of these efforts, several metagenomic studies focusing on bacterial and archaeal populations have reported the presence of contaminating eukaryotic sequences in metagenomic data sets. Contaminating sequences originate not only from genomes of micro-eukaryotic species but also from genomes of (higher) eukaryotic host cells. The latter scenario usually occurs in the case of host-associated metagenomes. Identification and removal of contaminating sequences is important, since these sequences not only impact estimates of microbial diversity but also affect the accuracy of several downstream analyses. Currently, the computational techniques used for identifying contaminating eukaryotic sequences, being alignment based, are slow, inefficient, and require huge computing resources. In this article, we present Eu-Detect, an alignment-free algorithm that can rapidly identify eukaryotic sequences contaminating metagenomic data sets. Validation results indicate that on a desktop with modest hardware specifications, the Eu-Detect algorithm is able to rapidly segregate DNA sequence fragments of prokaryotic and eukaryotic origin, with high sensitivity. A Web server for the Eu-Detect algorithm is available at http://metagenomics.atc.tcs.com/Eu-Detect/.

Eu-Detect: An algorithm for detecting eukaryotic sequences in metagenomic data sets

期刊

JOURNAL OF BIOSCIENCES

出版社

INDIAN ACAD SCIENCES

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Eu-Detect: An algorithm for detecting eukaryotic sequences in metagenomic data sets

期刊

JOURNAL OF BIOSCIENCES

出版社

INDIAN ACAD SCIENCES

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文