4.7 Article

Improved quality control processing of peptide-centric LC-MS proteomics data

期刊

BIOINFORMATICS
卷 27, 期 20, 页码 2866-2872

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btr479

关键词

-

资金

  1. National Institutes of Health
  2. National Institute of General Medical Sciences [R01 GM084892]
  3. National Institute of Environmental Health Sciences [U54 ES 016015]
  4. National Institute of Allergy and Infectious Disease [HHSN272200800060C]
  5. DOE [DE-AC05-76RLO1830]

向作者/读者索取更多资源

Motivation: In the analysis of differential peptide peak intensities (i.e. abundance measures), LC-MS analyses with poor quality peptide abundance data can bias downstream statistical analyses and hence the biological interpretation for an otherwise high-quality dataset. Although considerable effort has been placed on assuring the quality of the peptide identification with respect to spectral processing, to date quality assessment of the subsequent peptide abundance data matrix has been limited to a subjective visual inspection of run-by-run correlation or individual peptide components. Identifying statistical outliers is a critical step in the processing of proteomics data as many of the downstream statistical analyses [e.g. analysis of variance (ANOVA)] rely upon accurate estimates of sample variance, and their results are influenced by extreme values. Results: We describe a novel multivariate statistical strategy for the identification of LC-MS runs with extreme peptide abundance distributions. Comparison with current method (run-by-run correlation) demonstrates a significantly better rate of identification of outlier runs by the multivariate strategy. Simulation studies also suggest that this strategy significantly outperforms correlation alone in the identification of statistically extreme liquid chromatography-mass spectrometry (LC-MS) runs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据