4.8 Article

Recognizing millions of consistently unidentified spectra across hundreds of shotgun proteomics datasets

期刊

NATURE METHODS
卷 13, 期 8, 页码 651-+

出版社

NATURE PUBLISHING GROUP
DOI: 10.1038/NMETH.3902

关键词

-

资金

  1. Vienna Science and Technology Fund (WWTF) [LS11-045]
  2. Wellcome Trust [WT101477MA]
  3. BBSRC [BB/K01997X/1, BB/I00095X/1]
  4. Deutsche Forschungsgemeinschaft [SFB685]
  5. BMBF [01ZX1301F]
  6. BBSRC [BB/I000909/1, BB/K01997X/1, BB/K020145/1, BB/I00095X/1] Funding Source: UKRI
  7. Biotechnology and Biological Sciences Research Council [BB/K020145/1, BB/K01997X/1, BB/I00095X/1, BB/I000909/1] Funding Source: researchfish

向作者/读者索取更多资源

Mass spectrometry (MS) is the main technology used in proteomics approaches. However, on average, 75% of spectra analyzed in an MS experiment remain unidentified. We propose to use spectrum clustering at a large scale to shed light on these unidentified spectra. The Proteomics Identifications (PRIDE) Database Archive is one of the largest MS proteomics public data repositories worldwide. By clustering all tandem MS spectra publicly available in the PRIDE Archive, coming from hundreds of data sets, we were able to consistently characterize spectra into three distinct groups: (1) incorrectly identified, (2) correctly identified but below the set scoring threshold, and (3) truly unidentified. Using multiple complementary analysis approaches, we were able to identify similar to 20% of the consistently unidentified spectra. The complete spectrum-clustering results are available through the new version of the PRIDE Cluster resource (http://www.ebi.ac.uk/pride/cluster). This resource is intended, among other aims, to encourage and simplify further investigation into these unidentified spectra.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据