4.5 Article

Profiling MS proteomics data using smoothed non-linear energy operator and Bayesian additive regression trees

Journal

PROTEOMICS
Volume 9, Issue 17, Pages 4176-4191

Publisher

WILEY
DOI: 10.1002/pmic.200800502

Keywords

Bioinformatics; Cancer diagnosis; Machine learning; MS; Peak detection

Funding

  1. Leverhulme Trust Early Career Fellowship [ECF/2007/0433]
  2. NERC [NER/J/S/2002/006/8]
  3. BBSRC [BB/F016298/1]
  4. Biotechnology and Biological Sciences Research Council [BB/F016298/1] Funding Source: researchfish
  5. BBSRC [BB/F016298/1] Funding Source: UKRI

Ask authors/readers for more resources

This paper proposes a novel profiling method for SELDI-TOF and MALDI-TOF MS data that integrates a novel peak detection method based on modified smoothed non-linear energy operator, correlation-based peak selection and Bayesian additive regression trees. The peak detection and classification performance of the proposed approach is validated on two publicly available MS data sets, namely MALDI-TOF simulation data and high-resolution SELDI-TOF ovarian cancer data. The results compared favorably with three state-of-the-art peak detection algorithms and four machine-learning algorithms. For the high-resolution ovarian cancer data set, seven biomarkers (m/z windows) were found by our method, which achieved 97.30 and 99.10% accuracy at 25th and 75th percentiles, respectively, from 50 independent cross-validation samples, which is significantly better than other profiling and dimensional reduction methods. The results show that the method is capable of finding parsimonious sets of biologically meaningful biomarkers with better accuracy than existing methods. Supporting Information material and MATLAB/R scripts to implement the methods described in the article are available at: http://www.cs.bham.ac.uk/szh/Source-Codefor-Proteomics.zip

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available