4.8 Article

Method for Assessing the Statistical Significance of Mass Spectral Similarities Using Basic Local Alignment Search Tool Statistics

Journal

ANALYTICAL CHEMISTRY
Volume 85, Issue 17, Pages 8291-8297

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/ac401564v

Keywords

-

Funding

  1. JST
  2. Strategic International Collaborative Research Program
  3. SICORP for JP-US metabolomics
  4. Grants in Aid for Scientific Research (B)
  5. Grants-in-Aid for Scientific Research [25820400, 25871136] Funding Source: KAKEN

Ask authors/readers for more resources

A novel method for assessing the statistical significance of mass spectral similarities was developed using modified basic local alignment search tool (BLAST; Karlin- Altschul) statistics. In gas chromatography/mass spectrometry-based metabolomics, many signals in raw metabolome data are identified on the basis of unexpected similarities among mass spectra and the spectra of standards. Since there is inevitably noise in the observed spectra, a list of identified metabolites includes some false positives. In the developed method, electron ionization (El) mass spectrometry-BLAST, a similarity score of two mass spectra is calculated using a general scoring scheme, from which the probability of obtaining the score by chance (P value) is calculated. For this purpose, a simple rule for converting a unit El mass spectrum to a mass spectral sequence as well as a score matrix for aligned mass spectral sequences was developed. A Monte Carlo simulation using randomly generated mass spectral sequences demonstrated that the null distribution or the expected number of hits (E value) follows modified Karlin-Altschul statistics. A metabolite data set obtained from green tea extract was analyzed using the developed method. Among 171 metabolite signals in the metabolome data, 93 signals were identified on the basis of significant similarities (P < 0.015) with reference data. Since the expected number of false positives is 2.6, the false discovery rate was estimated to be 2.8%, indicating that the search threshold (P < 0.015) is reasonable for metabolite identification.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available