☆ 4.7 Article

Large scale analysis of MASCOT results using a mass accuracy-based THreshold (MATH) effectively improves data interpretation

JOURNAL OF PROTEOME RESEARCH (2005)

Journal

JOURNAL OF PROTEOME RESEARCH

Volume 4, Issue 4, Pages 1353-1360

Publisher

AMER CHEMICAL SOC

DOI: 10.1021/pr0500509

Keywords

bioinformatics; MASCOT; data analysis; search algorithms; statistics; IEF/LC-MS/MS; SEQUEST; data standards; biomarkers

Funding

NCI NIH HHS [CA 107988, CA 103086] Funding Source: Medline
NCRR NIH HHS [RR 21239] Funding Source: Medline

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In this report, we take a heuristic approach to studying the effects of mass tolerance settings and database size on the sensitivity and specificity of MASCOT. We also examine the efficacy of the MASCOT Identity Threshold as a discriminator when applied to QqTOF data with an average mass accuracy of 10 ppm or better. As predicted, arbitrarily large mass tolerance settings negatively affect MASCOT's specificity, and to a lesser degree, sensitivity. Increased mass tolerances also render the generation of a significance threshold less effective. To study these effects, we used Bayes' Law to calculate MASCOT's predictive values. With a relatively small search database (Human IPI), MASCOT had a mean positive predictive value of 0.993 when combined with MASCOT's Identity Threshold. However, the corresponding average negative predictive value, or the probability that an ion was not present given no score or a score below threshold, was reduced as mass tolerances were tightened, and had an average value of 0.717. This value was improved upon by extrapolating an empirical threshold using a reversed database search and a new algorithm to rapidly identify false positive identifications. Using the empirical threshold reduced false negative identifications on the average 17% while limiting the false positive rate to below 5%; even larger reductions were obtained using mass tolerances approaching two times the actual error of the experimental data. A simple application of this strategy to the analysis of a microdissected glioblastoma multiforme sample analyzed by IEF/LC-MS/MS is reported, as is a description of the tools required to implement a large scale analysis using this alternative approach.

Large scale analysis of MASCOT results using a mass accuracy-based THreshold (MATH) effectively improves data interpretation

Journal

JOURNAL OF PROTEOME RESEARCH

Publisher

AMER CHEMICAL SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Large scale analysis of MASCOT results using a mass accuracy-based THreshold (MATH) effectively improves data interpretation

Journal

JOURNAL OF PROTEOME RESEARCH

Publisher

AMER CHEMICAL SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper