4.7 Article

Separating the wheat from the chaff: unbiased filtering of background tandem mass spectra improves protein identification

期刊

JOURNAL OF PROTEOME RESEARCH
卷 7, 期 8, 页码 3382-3395

出版社

AMER CHEMICAL SOC
DOI: 10.1021/pr800140v

关键词

proteomics; LC-MS/MS; sequence similarity searches; background spectra filtering; de novo sequencing; MS BLAST

资金

  1. NIGMS NIH HHS [R01 GM070986, R01 GM070986-04, 1 R01 GM 070986-01A1] Funding Source: Medline

向作者/读者索取更多资源

Only a small fraction of spectra acquired in LC-MS/MS runs matches peptides from target proteins upon database searches. The remaining, operationally termed background, spectra originate from a variety of poorly controlled sources and affect the throughput and confidence of database searches. Here, we report an algorithm and its software implementation that rapidly removes background spectra, regardless of their precise origin. The method estimates the dissimilarity distance between screened MS/MS spectra and unannotated spectra from a partially redundant background library compiled from several control and blank runs. Filtering MS/MS queries enhanced the protein identification capacity when searches lacked spectrum to sequence matching specificity. In sequence-similarity searches it reduced by, on average, 30-fold the number of orphan hits, which were not explicitly related to background protein contaminants and required manual validation. Removing high quality background MS/MS spectra, while preserving in the data set the genuine spectra from target proteins, decreased the false positive rate of stringent database searches and improved the identification of low-abundance proteins.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据