4.7 Article

Large scale analysis of MASCOT results using a mass accuracy-based THreshold (MATH) effectively improves data interpretation

Journal

JOURNAL OF PROTEOME RESEARCH
Volume 4, Issue 4, Pages 1353-1360

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/pr0500509

Keywords

bioinformatics; MASCOT; data analysis; search algorithms; statistics; IEF/LC-MS/MS; SEQUEST; data standards; biomarkers

Funding

  1. NCI NIH HHS [CA 107988, CA 103086] Funding Source: Medline
  2. NCRR NIH HHS [RR 21239] Funding Source: Medline

Ask authors/readers for more resources

In this report, we take a heuristic approach to studying the effects of mass tolerance settings and database size on the sensitivity and specificity of MASCOT. We also examine the efficacy of the MASCOT Identity Threshold as a discriminator when applied to QqTOF data with an average mass accuracy of 10 ppm or better. As predicted, arbitrarily large mass tolerance settings negatively affect MASCOT's specificity, and to a lesser degree, sensitivity. Increased mass tolerances also render the generation of a significance threshold less effective. To study these effects, we used Bayes' Law to calculate MASCOT's predictive values. With a relatively small search database (Human IPI), MASCOT had a mean positive predictive value of 0.993 when combined with MASCOT's Identity Threshold. However, the corresponding average negative predictive value, or the probability that an ion was not present given no score or a score below threshold, was reduced as mass tolerances were tightened, and had an average value of 0.717. This value was improved upon by extrapolating an empirical threshold using a reversed database search and a new algorithm to rapidly identify false positive identifications. Using the empirical threshold reduced false negative identifications on the average 17% while limiting the false positive rate to below 5%; even larger reductions were obtained using mass tolerances approaching two times the actual error of the experimental data. A simple application of this strategy to the analysis of a microdissected glioblastoma multiforme sample analyzed by IEF/LC-MS/MS is reported, as is a description of the tools required to implement a large scale analysis using this alternative approach.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available