4.5 Review

Scalable Data Analysis in Proteomics and Metabolomics Using BioContainers and Workflows Engines

期刊

PROTEOMICS
卷 20, 期 9, 页码 -

出版社

WILEY
DOI: 10.1002/pmic.201900147

关键词

bioconda; biocontainers; bioinformatics; containers; large scale data analysis; workflows

资金

  1. BBSRC [BB/L024225/1]
  2. Wellcome Trust [208391/Z/17/Z]
  3. European Commission [676559]
  4. BBSRC [BB/L024225/1] Funding Source: UKRI

向作者/读者索取更多资源

The recent improvements in mass spectrometry instruments and new analytical methods are increasing the intersection between proteomics and big data science. In addition, bioinformatics analysis is becoming increasingly complex and convoluted, involving multiple algorithms and tools. A wide variety of methods and software tools have been developed for computational proteomics and metabolomics during recent years, and this trend is likely to continue. However, most of the computational proteomics and metabolomics tools are designed as single-tiered software application where the analytics tasks cannot be distributed, limiting the scalability and reproducibility of the data analysis. In this paper the key steps of metabolomics and proteomics data processing, including the main tools and software used to perform the data analysis, are summarized. The combination of software containers with workflows environments for large-scale metabolomics and proteomics analysis is discussed. Finally, a new approach for reproducible and large-scale data analysis based on BioContainers and two of the most popular workflow environments, Galaxy and Nextflow, is introduced to the proteomics and metabolomics communities.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据