4.7 Article

Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation

期刊

JOURNAL OF PROTEOME RESEARCH
卷 21, 期 6, 页码 1510-1524

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jproteome.2c00131

关键词

proteomics; database; phosphoproteomics; mass spectrometry; phosphorylation; phosphopeptides; phosphosites; PhosphoSitePlus; PeptideAtlas; proteome; false discovery rate; evolutionary conservation; UniProt

资金

  1. BBSRC [BB/S017054/1, BB/R000182/1, BB/M012557/1, BB/S018514/1]
  2. North West Cancer Research (NWCR) [CR1157, CR1088]
  3. Cancer Research UK [C1443/A22095]
  4. National Science Foundation [DBI-193331 1]
  5. National Institutes of Health [R01GM087221, R24GM127667]
  6. BBSRC [BB/R000182/1, BB/S017054/1, BB/S018514/1] Funding Source: UKRI

向作者/读者索取更多资源

This study profiles the human phosphoproteome to estimate the false discovery rate of phosphosites and predict a more realistic count of true identifications. It highlights significant potential of false-positive data in phosphorylation databases and emphasizes the importance of quality control for such data.
Public phosphorylation databases such as PhosphoSitePlus (PSP) and PeptideAtlas (PA) compile results from published papers or openly available mass spectrometry (MS) data. However, there is no database-level control for false discovery of sites, likely leading to the overestimation of true phosphosites. By profiling the human phosphoproteome, we estimate the false discovery rate (FDR) of phosphosites and predict a more realistic count of true identifications. We rank sites into phosphorylation likelihood sets and analyze them in terms of conservation across 100 species, sequence properties, and functional annotations. We demonstrate significant differences between the sets and develop a method for independent phosphosite FDR estimation. Remarkably, we report estimated FDRs of 84, 98, and 82% within sets of phosphoserine (pSer), phosphothreonine (pThr), and phosphotyrosine (pTyr) sites, respectively, that are supported by only a single piece of identification evidence-the majority of sites in PSP. We estimate that around 62 000 Ser, 8000 Thr, and 12 000 Tyr phosphosites in the human proteome are likely to be true, which is lower than most published estimates. Furthermore, our analysis estimates that 86 000 Ser, 50 000 Thr, and 26 000 Tyr phosphosites are likely false-positive identifications, highlighting the significant potential of false-positive data to be present in phosphorylation databases.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据