4.7 Article

MapReduce implementation of a hybrid spectral library-database search method for large-scale peptide identification

期刊

BIOINFORMATICS
卷 27, 期 21, 页码 3072-3073

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btr523

关键词

-

资金

  1. National Science Foundation [IIS 0916463]
  2. Department of Energy's Office of Biological and Environmental Research and Office of Advanced Scientific Computing Research [57271, 54976]
  3. Direct For Computer & Info Scie & Enginr
  4. Div Of Information & Intelligent Systems [0916463] Funding Source: National Science Foundation

向作者/读者索取更多资源

A MapReduce-based implementation called MRMSPolygraph for parallelizing peptide identification from mass spectrometry data is presented. The underlying serial method, MSPolygraph, uses a novel hybrid approach to match an experimental spectrum against a combination of a protein sequence database and a spectral library. Our MapReduce implementation can run on any Hadoop cluster environment. Experimental results demonstrate that, relative to the serial version, MR-MSPolygraph reduces the time to solution from weeks to hours, for processing tens of thousands of experimental spectra. Speedup and other related performance studies are also reported on a 400-core Hadoop cluster using spectral datasets from environmental microbial communities as inputs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据