4.4 Article

Community-wide analysis of microbial genome sequence signatures

期刊

GENOME BIOLOGY
卷 10, 期 8, 页码 -

出版社

BMC
DOI: 10.1186/gb-2009-10-8-r85

关键词

-

资金

  1. DOE Genomics (Office of Science) [DE-FG02-05ER64134]
  2. Swedish Research Council
  3. Carl Tryggers Foundation

向作者/读者索取更多资源

Background: Analyses of DNA sequences from cultivated microorganisms have revealed genome-wide, taxa-specific nucleotide compositional characteristics, referred to as genome signatures. These signatures have far-reaching implications for understanding genome evolution and potential application in classification of metagenomic sequence fragments. However, little is known regarding the distribution of genome signatures in natural microbial communities or the extent to which environmental factors shape them. Results: We analyzed metagenomic sequence data from two acidophilic biofilm communities, including composite genomes reconstructed for nine archaea, three bacteria, and numerous associated viruses, as well as thousands of unassigned fragments from strain variants and low-abundance organisms. Genome signatures, in the form of tetranucleotide frequencies analyzed by emergent self-organizing maps, segregated sequences from all known populations sharing <50-60% average amino acid identity and revealed previously unknown genomic clusters corresponding to low-abundance organisms and a putative plasmid. Signatures were pervasive genome wide. Clusters were resolved because intra-genome differences resulting from translational selection or protein adaptation to the intracellular (pH similar to 5) versus extracellular (pH similar to 1) environment were small relative to inter-genome differences. We found that these genome signatures stem from multiple influences but are primarily manifested through codon composition, which we propose is the result of genome-specific mutational biases. Conclusions: An important conclusion is that shared environmental pressures and interactions among coevolving organisms do not obscure genome signatures in acid mine drainage communities. Thus, genome signatures can be used to assign sequence fragments to populations, between pairs of genomes correlates with the difference predicted based on codon composition. Additional data file 7 shows tetra-ESOM of deeply-sampled genomes for which coding and noncoding regions were separated. Additional data file 8 shows for incorrectly binned fragments the percentage of sequence coding for genes in comparison with the genome-wide coding percentage. Additional data file 9 shows tetra-ESOM of Leptospirillum group II genes coding for highly expressed proteins or proteins enriched in the extracellular fraction analyzed as separate fractions from the rest of the genome. Additional data file 10 is a schematic of processes and factors influencing genome signature.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据