4.5 Article

Tail-Robust Quantile Normalization

Journal

PROTEOMICS
Volume 20, Issue 24, Pages -

Publisher

WILEY
DOI: 10.1002/pmic.202000068

Keywords

missing values; normalization; PRIDE; proteomics; rank invariance

Funding

  1. German Ministry of Education and Research [FKZ031L0080]
  2. Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) [CIBSS-EXC-2189-2100249960-390939984]

Ask authors/readers for more resources

High-throughput biological data-such as mass spectrometry (MS)-based proteomics data-suffer from systematic non-biological variance due to systematic errors. This hinders the estimation of real biological signals and, in turn, decreases the power of statistical tests and biases the identification of differentially expressed proteins. To remove such unintended variation, while retaining the biological signal of interest, analysis workflows for quantitative MS data typically comprise normalization prior to their statistical analysis. Several normalization methods, such as quantile normalization (QN), have originally been developed for microarray data. In contrast to microarray data proteomics data may contain features, in the form of protein intensities that are consistently high across experimental conditions and, hence, are encountered in the tails of the protein intensity distribution. If QN is applied in the presence of such proteins statistical inferences of the features' intensity profiles are impeded due to the biased estimation of their variance. A freely available, novel approach is introduced which serves as an improvement of the classical QN by preserving the biological signals of features in the tails of the intensity distribution and by accounting for sample-dependent missing values (MVs): The tail-robust quantile normalization (TRQN).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available