4.7 Article

Challenging Targets or Describing Mismatches? A Comment on Common Decoy Distribution by Madej et al.

Journal

JOURNAL OF PROTEOME RESEARCH
Volume 21, Issue 12, Pages 2840-2845

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.jproteome.2c00279

Keywords

common decoy distribution; false discovery rate; peptide-spectrum match; shotgun proteomics; target-decoy competition

Funding

  1. French National Research Agency: ProFI project [ANR-10-INBS-08]
  2. GRAL project [ANR-10-LABX49-01]
  3. MIAI @ Grenoble Alpes [ANR-19-P3IA-0003]

Ask authors/readers for more resources

In their recent article, Madej et al. proposed a new method to control false discovery rate (FDR) in peptide-spectrum-match (PSM) validation. The method, called Common Decoy Distribution (CDD), uses a precise distribution of decoy matches to control FDR during a target-only search. It combines the advantages of decoy-based approaches and decoy-free approaches, and provides practical insights for improving FDR control methods in proteomics.
In their recent article, Madej et al. (Madej, D.; Wu, L.; Lam, H.Common Decoy Distributions Simplify False Discovery Rate Estimation in Shotgun Proteomics. J. Proteome Res.2022, 21 (2), 339-348) proposed an original way to solve the recurrent issue of controlling for the false discovery rate (FDR) in peptide-spectrum-match (PSM) validation. Briefly, they proposed to derive a single precise distribution of decoy matches termed the Common Decoy Distribution (CDD) and to use it to control for FDR during a target-only search. Conceptually, this approach is appealing as it takes the best of two worlds, i.e., decoy-based approaches (which leverage a large-scale collection of empirical mismatches) and decoy-free approaches (which are not subject to the randomness of decoy generation while sparing an additional database search). Interestingly, CDD also corresponds to a middle-of-the-road approach in statistics with respect to the two main families of FDR control procedures: Although historically based on estimating the false-positive distribution, FDR control has recently been demonstrated to be possible thanks to competition between the original variables (in proteomics, target sequences) and their fictional counterparts (in proteomics, decoys). Discriminating between these two theoretical trends is of prime importance for computational proteomics. In addition to highlighting why proteomics was a source of inspiration for theoretical biostatistics, it provides practical insights into the improvements that can be made to FDR control methods used in proteomics, including CDD.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available