4.7 Article

Recovery of 197 eukaryotic bins reveals major challenges for eukaryote genome reconstruction from terrestrial metagenomes

期刊

MOLECULAR ECOLOGY RESOURCES
卷 23, 期 5, 页码 1066-1076

出版社

WILEY
DOI: 10.1111/1755-0998.13776

关键词

eukaryotes; genome-resolved metagenomics; Hypocreales; Mamiellales; Saccharomycetales

向作者/读者索取更多资源

Most eukaryotic genomes remain unsequenced, limiting our understanding of their role in ecosystem processes. This study aimed to recover eukaryotic genomes from metagenomes using the EukRep pipeline. Out of 6000 metagenomes, only 215 yielded eukaryotic bins, with Streptophytes and fungi being the most represented clades. The completeness and contamination estimates of the recovered bins were obtained and highlighted the need for long-read sequencing and improved reference genomes databases.
As most eukaryotic genomes are yet to be sequenced, the mechanisms underlying their contribution to different ecosystem processes remain untapped. Although approaches to recovering Prokaryotic genomes have become common in genome biology, few studies have tackled the recovery of eukaryotic genomes from metagenomes. This study assessed the reconstruction of microbial eukaryotic genomes using 6000 metagenomes from terrestrial and some transition environments using the EukRep pipeline. Only 215 metagenomic libraries yielded eukaryotic bins. From a total of 447 eukaryotic bins recovered 197 were classified at the phylum level. Streptophytes and fungi were the most represented clades with 83 and 73 bins, respectively. More than 78% of the obtained eukaryotic bins were recovered from samples whose biomes were classified as host-associated, aquatic, and anthropogenic terrestrial. However, only 93 bins were taxonomically assigned at the genus level and 17 bins at the species level. Completeness and contamination estimates were obtained for a total of 193 bins and consisted of 44.64% (sigma = 27.41%) and 3.97% (sigma = 6.53%), respectively. Micromonas commoda was the most frequent taxon found while Saccharomyces cerevisiae presented the highest completeness, probably because more reference genomes are available. Current measures of completeness are based on the presence of single-copy genes. However, mapping of the contigs from the recovered eukaryotic bins to the chromosomes of the reference genomes showed many gaps, suggesting that completeness measures should also include chromosome coverage. Recovering eukaryotic genomes will benefit significantly from long-read sequencing, development of tools for dealing with repeat-rich genomes, and improved reference genomes databases.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据