Journal
PROTEOMICS
Volume 9, Issue 17, Pages 4176-4191Publisher
WILEY
DOI: 10.1002/pmic.200800502
Keywords
Bioinformatics; Cancer diagnosis; Machine learning; MS; Peak detection
Funding
- Leverhulme Trust Early Career Fellowship [ECF/2007/0433]
- NERC [NER/J/S/2002/006/8]
- BBSRC [BB/F016298/1]
- Biotechnology and Biological Sciences Research Council [BB/F016298/1] Funding Source: researchfish
- BBSRC [BB/F016298/1] Funding Source: UKRI
Ask authors/readers for more resources
This paper proposes a novel profiling method for SELDI-TOF and MALDI-TOF MS data that integrates a novel peak detection method based on modified smoothed non-linear energy operator, correlation-based peak selection and Bayesian additive regression trees. The peak detection and classification performance of the proposed approach is validated on two publicly available MS data sets, namely MALDI-TOF simulation data and high-resolution SELDI-TOF ovarian cancer data. The results compared favorably with three state-of-the-art peak detection algorithms and four machine-learning algorithms. For the high-resolution ovarian cancer data set, seven biomarkers (m/z windows) were found by our method, which achieved 97.30 and 99.10% accuracy at 25th and 75th percentiles, respectively, from 50 independent cross-validation samples, which is significantly better than other profiling and dimensional reduction methods. The results show that the method is capable of finding parsimonious sets of biologically meaningful biomarkers with better accuracy than existing methods. Supporting Information material and MATLAB/R scripts to implement the methods described in the article are available at: http://www.cs.bham.ac.uk/szh/Source-Codefor-Proteomics.zip
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available