☆ 4.5 Article

Estimating false discovery rates for peptide and protein identification using randomized databases

PROTEOMICS (2010)

Journal

PROTEOMICS

Volume 10, Issue 12, Pages 2369-2376

Publisher

WILEY

DOI: 10.1002/pmic.200900619

Keywords

Bioinformatics; False discovery rates; Peptide identification; Protein identification; Randomized databases

Funding

NIH [5R01 GM076680-02, UO1 DK072473]
NSF [DBI-0544757, NSF-07140]
McMillen Foundation
University of Washington's Proteomics Resource [UWPR95794]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

MS-based proteomics characterizes protein contents of biological samples. The most common approach is to first match observed MS/MS peptide spectra against theoretical spectra from a protein sequence database and then to score these matches. The false discovery rate (FDR) can be estimated as a function of the score by searching together the protein sequence database and its randomized version and comparing the score distributions of the randomized versus nonrandomized matches. This work introduces a straightforward isotonic regression-based method to estimate the cumulative FDRs and local FDRs (LFDRs) of peptide identification. Our isotonic method not only performed as well as other methods used for comparison, but also has the advantages of being: (i) monotonic in the score, (ii) computationally simple, and (iii) not dependent on assumptions about score distributions. We demonstrate the flexibility of our approach by using it to estimate FDRs and LFDRs for protein identification using summaries of the peptide spectra scores. We reconfirmed that several of these methods were superior to a two-peptide rule. Finally, by estimating both the FDRs and LFDRs, we showed for both peptide and protein identification, moderate FDR values (5%) corresponded to large LFDR values (53 and 60%).

Estimating false discovery rates for peptide and protein identification using randomized databases

Journal

PROTEOMICS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Estimating false discovery rates for peptide and protein identification using randomized databases

Journal

PROTEOMICS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper