期刊
BIOINFORMATICS
卷 27, 期 21, 页码 3072-3073出版社
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btr523
关键词
-
类别
资金
- National Science Foundation [IIS 0916463]
- Department of Energy's Office of Biological and Environmental Research and Office of Advanced Scientific Computing Research [57271, 54976]
- Direct For Computer & Info Scie & Enginr
- Div Of Information & Intelligent Systems [0916463] Funding Source: National Science Foundation
A MapReduce-based implementation called MRMSPolygraph for parallelizing peptide identification from mass spectrometry data is presented. The underlying serial method, MSPolygraph, uses a novel hybrid approach to match an experimental spectrum against a combination of a protein sequence database and a spectral library. Our MapReduce implementation can run on any Hadoop cluster environment. Experimental results demonstrate that, relative to the serial version, MR-MSPolygraph reduces the time to solution from weeks to hours, for processing tens of thousands of experimental spectra. Speedup and other related performance studies are also reported on a 400-core Hadoop cluster using spectral datasets from environmental microbial communities as inputs.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据