☆ 4.6 Article

Correlated z-Values and the Accuracy of Large-Scale Statistical Estimates

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2010)

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

Volume 105, Issue 491, Pages 1042-1055

Publisher

AMER STATISTICAL ASSOC

DOI: 10.1198/jasa.2010.tm09129

Keywords

Acceleration; Correlation penalty; Empirical process; Mehler's identity; Nonnull z-values; Rms correlation

Funding

NIH [8R01 EB002784]
NSF [DMS0505673]
Division Of Mathematical Sciences
Direct For Mathematical & Physical Scien [0854973] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

We consider large-scale studies in which there are hundreds or thousands of correlated cases to investigate, each represented by its own normal variate, typically a z-value. A familiar example is provided by a microarray experiment comparing healthy with sick subjects' expression levels for thousands of genes. This paper concerns the accuracy of summary statistics for the collection of normal variates, such as their empirical cdf or a false discovery rate statistic. It seems like we must estimate an N by N correlation matrix, N the number of cases, but our main result shows that this is not necessary: good accuracy approximations can be based on the root mean square correlation over all N . (N - 1)/2 pairs, a quantity often easily estimated. A second result shows that z-values closely follow normal distributions even under nonnull conditions, supporting application of the main theorem. Practical application of the theory is illustrated for a large leukemia microarray study.

Correlated z-Values and the Accuracy of Large-Scale Statistical Estimates

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

Publisher

AMER STATISTICAL ASSOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Correlated z-Values and the Accuracy of Large-Scale Statistical Estimates

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

Publisher

AMER STATISTICAL ASSOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper