期刊
ISME JOURNAL
卷 6, 期 4, 页码 898-901出版社
NATURE PUBLISHING GROUP
DOI: 10.1038/ismej.2011.147
关键词
metagenome; assembly; Illumina
资金
- US Department of Energy [DE-SC0004601, DE-AC02-0SCH11231]
- U.S. Department of Energy (DOE) [DE-SC0004601] Funding Source: U.S. Department of Energy (DOE)
Assembling individual genomes from complex community metagenomic data remains a challenging issue for environmental studies. We evaluated the quality of genome assemblies from community short read data (Illumina 100 bp pair-ended sequences) using datasets recovered from freshwater and soil microbial communities as well as in silico simulations. Our analyses revealed that the genome of a single genotype (or species) can be accurately assembled from a complex metagenome when it shows at least about 20 x coverage. At lower coverage, however, the derived assemblies contained a substantial fraction of non-target sequences (chimeras), which explains, at least in part, the higher number of hypothetical genes recovered in metagenomic relative to genomic projects. We also provide examples of how to detect intrapopulation structure in metagenomic datasets and estimate the type and frequency of errors in assembled genes and contigs from datasets of varied species complexity. The ISME Journal (2012) 6, 898-901; doi: 10.1038/ismej.2011.147; published online 27 October 2011
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据