☆ 4.7 Article

Improved classification of mass spectrometry database search results using newer machine learning approaches

MOLECULAR & CELLULAR PROTEOMICS (2006)

Journal

MOLECULAR & CELLULAR PROTEOMICS

Volume 5, Issue 3, Pages 497-509

Publisher

ELSEVIER

DOI: 10.1074/mcp.M500233-MCP200

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Manual analysis of mass spectrometry data is a current bottleneck in high throughput proteomics. In particular, the need to manually validate the results of mass spectrometry database searching algorithms can be prohibitively time-consuming. Development of software tools that attempt to quantify the confidence in the assignment of a protein or peptide identity to a mass spectrum is an area of active interest. We sought to extend work in this area by investigating the potential of recent machine learning algorithms to improve the accuracy of these approaches and as a flexible framework for accommodating new data features. Specifically we demonstrated the ability of boosting and random forest approaches to improve the discrimination of true hits from false positive identifications in the results of mass spectrometry database search engines compared with thresholding and other machine learning approaches. We accommodated additional attributes obtainable from database search results, including a factor addressing proton mobility. Performance was evaluated using publically available electrospray data and a new collection of MALDI data generated from purified human reference proteins.

Improved classification of mass spectrometry database search results using newer machine learning approaches

Journal

MOLECULAR & CELLULAR PROTEOMICS

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Improved classification of mass spectrometry database search results using newer machine learning approaches

Journal

MOLECULAR & CELLULAR PROTEOMICS

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper