4.5 Article

Comparative analysis of statistical methods used for detecting differential expression in label-free mass spectrometry proteomics

Journal

JOURNAL OF PROTEOMICS
Volume 129, Issue -, Pages 83-92

Publisher

ELSEVIER
DOI: 10.1016/j.jprot.2015.07.012

Keywords

Label-free mass spectrometry; Spectral counts; Differential expression; Statistical methodology

Funding

  1. Juvenile Diabetes Research Foundation [17-2011-658]
  2. Diabetes UK [12/0004530]
  3. Fondation Leducq (MIRVAD) [13 CVD 02]
  4. National Institute of Health Research Biomedical Research Center based at Guy's and St Thomas' National Health Service Foundation Trust
  5. King's College London
  6. King's College Hospital and an excellence initiative (Competence Centers for Excellent Technologies - COMET) of the Austrian Research Promotion Agency FFG: Research Center of Excellence in Vascular Ageing - Tyrol, VASCage - BMVIT [843536]
  7. BMWFW
  8. Wirtschaftsagentur Wien
  9. Standortagentur Tirol
  10. British Heart Foundation [FS/08/002/24537, FS/13/2/29892] Funding Source: researchfish

Ask authors/readers for more resources

Label-free LC-MS/MS proteomics has proven itself to be a powerful method for evaluating protein identification and quantification from complex samples. For comparative proteomics, several methods have been used to detect the differential expression of proteins from such data. We have assessed seven methods used across the literature for detecting differential expression from spectral count quantification: Student's t-test, significance analysis of micro-arrays (SAM), normalised spectral abundance factor (NSAF), normalised spectral abundance factor-power law global error model (NSAF-PLGEM), spectral index (Spl), DESeq and QSpec. We used 2000 simulated datasets as well as publicly available data from a proteomic standards study to assess the ability of these methods to detect differential expression in varying effect sizes and proportions of differentially expressed proteins. At two false discovery rate (FDR) levels, we find that several of the methods detect differential expression within the data with reasonable precision, others detect differential expression at the expense of low precision, and finally, others which fail to identify any differentially expressed proteins. The inability of these seven methods to fully capture the differential landscape, even at the largest effect size, illustrates some of the limitations of the existing technologies and the statistical methodologies. Significance: In label-free mass spectrometry experiments, protein identification and quantification have always been important but there is now a growing focus on comparative proteomics. Detecting differential expression in protein levels can inform on important biological mechanisms and provide direction for further study. Given the high cost and labour intensive nature of validation experiments, statistical methods are important for prioritising proteins of interest. Here, we have performed a comparative analysis to investigate the statistical methodologies for detecting differential expression and provide a reference for future experimental designs. This article is part of a Special Issue entitled: Computational Proteomics. (C) 2015 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available