4.6 Article

Metagenomics workflow for hybrid assembly, differential coverage binning, metatranscriptomics and pathway analysis (MUFFIN)

Journal

PLOS COMPUTATIONAL BIOLOGY
Volume 17, Issue 2, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1008716

Keywords

-

Funding

  1. Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) [BR 5692/1-1, BR 5692/1-2]
  2. FORMAS [942-2015-1008]
  3. Collaborative Research Centre AquaDiva of the Friedrich Schiller University Jena [CRC 1076 AquaDiva]
  4. DFG
  5. Joachim Herz Foundation by the add-on fellowship for interdisciplinary life science

Ask authors/readers for more resources

Metagenomics has revolutionized microbiology, but analyzing metagenome-assembled genomes remains challenging. MUFFIN, a bioinformatics workflow, integrates short and long reads to reconstruct bacterial species and their functions with high reproducibility and efficiency. MUFFIN is a user-friendly tool that automates complex analysis steps and provides valuable insights into microbial communities.
Metagenomics has redefined many areas of microbiology. However, metagenome-assembled genomes (MAGs) are often fragmented, primarily when sequencing was performed with short reads. Recent long-read sequencing technologies promise to improve genome reconstruction. However, the integration of two different sequencing modalities makes downstream analyses complex. We, therefore, developed MUFFIN, a complete metagenomic workflow that uses short and long reads to produce high-quality bins and their annotations. The workflow is written by using Nextflow, a workflow orchestration software, to achieve high reproducibility and fast and straightforward use. This workflow also produces the taxonomic classification and KEGG pathways of the bins and can be further used for quantification and annotation by providing RNA-Seq data (optionally). We tested the workflow using twenty biogas reactor samples and assessed the capacity of MUFFIN to process and output relevant files needed to analyze the microbial community and their function. MUFFIN produces functional pathway predictions and, if provided de novo metatranscript annotations across the metagenomic sample and for each bin. MUFFIN is available on github under GNUv3 licence: . Author summary Determining the entire DNA of environmental samples (sequencing) is a fundamental approach to gain deep insights into complex bacterial communities and their functions. However, this approach produces enormous amounts of data, which makes analysis time intense and complicated. We developed the Software MUFFIN, which effortlessly untangle the complex sequencing data to reconstruct individual bacterial species and determine their functions. Our software is performing multiple complicated steps in parallel, automatically allowing everyone with only basic informatics skills to analyze complex microbial communities. For this, we combine two sequencing technologies: long-sequences (nanopore, better reconstruction) and short-sequences (Illumina, higher accuracy). After the reconstruction, we group the fragments that belong together (binning) via multiple approaches and refinement steps while also utilizing the information from other bacterial communities (differential binning). This process creates hundreds of bins whereas each represents a different bacterial species with a unique function. We automatically determine their species, assess each genome's completeness, and attribute their biological functions and activity (transcriptomics and pathways). Our Software is entirely freely available to everyone and runs on a good computer, compute cluster, or via cloud.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available