4.7 Article

Semisupervised model-based validation of peptide identifications in mass spectrometry-based proteomics

Journal

JOURNAL OF PROTEOME RESEARCH
Volume 7, Issue 1, Pages 254-265

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/pr070542g

Keywords

mass spectrometry; peptide identification; protein sequence database searching; statistical validation; semisupervised modeling; decoy sequences

Funding

  1. NATIONAL CANCER INSTITUTE [R01CA126239] Funding Source: NIH RePORTER
  2. NATIONAL CENTER FOR RESEARCH RESOURCES [U54RR020843] Funding Source: NIH RePORTER
  3. NCI NIH HHS [CA-126239, R01 CA126239] Funding Source: Medline
  4. NCRR NIH HHS [U54 RR020843] Funding Source: Medline

Ask authors/readers for more resources

Development of robust statistical methods for validation of peptide assignments to tandem mass (MS/MS) spectra obtained using database searching remains an important problem. PeptideProphet is one of the commonly used computational tools available for that purpose. An alternative simple approach for validation of peptide assignments is based on addition of decoy (reversed, randomized, or shuffled) sequences to the searched protein sequence database. The probabilistic modeling approach of PeptideProphet and the decoy strategy can be combined within a single semisupervised framework, leading to improved robustness and higher accuracy of computed probabilities even in the case of most challenging data sets. We present a semisupervised expectation-maximization (EM) algorithm for constructing a Bayes classifier for peptide identification using the probability mixture model, extending PeptideProphet to incorporate decoy peptide matches. Using several data sets of varying complexity, from control protein mixtures to a human plasma sample, and using three commonly used database search programs, SEQUEST, MASCOT, and TANDEM/k-score, we illustrate that more accurate mixture estimation leads to an improved control of the false discovery rate in the classification of peptide assignments.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available