4.6 Article

MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data

期刊

FRONTIERS IN MICROBIOLOGY
卷 13, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fmicb.2022.981458

关键词

metatranscriptomics; metagenomics; RNA-Seq; de novo assembly; computational pipeline

资金

  1. Russian Scientific Foundation
  2. [19-14-00172]

向作者/读者索取更多资源

While metagenome sequencing provides insights on microbial communities, metatranscriptome analysis is useful for studying functional activity. However, the complexity, dynamic range, and lack of computational methods for assembling metatranscriptomes pose challenges. This study presents MetaGT, a pipeline that combines metatranscriptomic and metagenomic data to improve assembly coverage and completeness. Results show significant improvement compared to existing methods. The pipeline, implemented in NextFlow, is available athttps://github.com/ablab/metaGT.
While metagenome sequencing may provide insights on the genome sequences and composition of microbial communities, metatranscriptome analysis can be useful for studying the functional activity of a microbiome. RNA-Seq data provides the possibility to determine active genes in the community and how their expression levels depend on external conditions. Although the field of metatranscriptomics is relatively young, the number of projects related to metatranscriptome analysis increases every year and the scope of its applications expands. However, there are several problems that complicate metatranscriptome analysis: complexity of microbial communities, wide dynamic range of transcriptome expression and importantly, the lack of high-quality computational methods for assembling meta-RNA sequencing data. These factors deteriorate the contiguity and completeness of metatranscriptome assemblies, therefore affecting further downstream analysis. Here we present MetaGT, a pipeline for de novo assembly of metatranscriptomes, which is based on the idea of combining both metatranscriptomic and metagenomic data sequenced from the same sample. MetaGT assembles metatranscriptomic contigs and fills in missing regions based on their alignments to metagenome assembly. This approach allows to overcome described complexities and obtain complete RNA sequences, and additionally estimate their abundances. Using various publicly available real and simulated datasets, we demonstrate that MetaGT yields significant improvement in coverage and completeness of metatranscriptome assemblies compared to existing methods that do not exploit metagenomic data. The pipeline is implemented in NextFlow and is freely available fromhttps://github.com/ablab/metaGT.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据