4.7 Article

Challenging Targets or Describing Mismatches? A Comment on Common Decoy Distribution by Madej et al.

期刊

JOURNAL OF PROTEOME RESEARCH
卷 21, 期 12, 页码 2840-2845

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jproteome.2c00279

关键词

common decoy distribution; false discovery rate; peptide-spectrum match; shotgun proteomics; target-decoy competition

资金

  1. French National Research Agency: ProFI project [ANR-10-INBS-08]
  2. GRAL project [ANR-10-LABX49-01]
  3. MIAI @ Grenoble Alpes [ANR-19-P3IA-0003]

向作者/读者索取更多资源

In their recent article, Madej et al. proposed a new method to control false discovery rate (FDR) in peptide-spectrum-match (PSM) validation. The method, called Common Decoy Distribution (CDD), uses a precise distribution of decoy matches to control FDR during a target-only search. It combines the advantages of decoy-based approaches and decoy-free approaches, and provides practical insights for improving FDR control methods in proteomics.
In their recent article, Madej et al. (Madej, D.; Wu, L.; Lam, H.Common Decoy Distributions Simplify False Discovery Rate Estimation in Shotgun Proteomics. J. Proteome Res.2022, 21 (2), 339-348) proposed an original way to solve the recurrent issue of controlling for the false discovery rate (FDR) in peptide-spectrum-match (PSM) validation. Briefly, they proposed to derive a single precise distribution of decoy matches termed the Common Decoy Distribution (CDD) and to use it to control for FDR during a target-only search. Conceptually, this approach is appealing as it takes the best of two worlds, i.e., decoy-based approaches (which leverage a large-scale collection of empirical mismatches) and decoy-free approaches (which are not subject to the randomness of decoy generation while sparing an additional database search). Interestingly, CDD also corresponds to a middle-of-the-road approach in statistics with respect to the two main families of FDR control procedures: Although historically based on estimating the false-positive distribution, FDR control has recently been demonstrated to be possible thanks to competition between the original variables (in proteomics, target sequences) and their fictional counterparts (in proteomics, decoys). Discriminating between these two theoretical trends is of prime importance for computational proteomics. In addition to highlighting why proteomics was a source of inspiration for theoretical biostatistics, it provides practical insights into the improvements that can be made to FDR control methods used in proteomics, including CDD.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据