4.6 Article

Estimating False Discovery Proportion Under Arbitrary Covariance Dependence

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
卷 107, 期 499, 页码 1019-1035

出版社

TAYLOR & FRANCIS INC
DOI: 10.1080/01621459.2012.720478

关键词

Arbitrary dependence structure; False discovery rate; Genome-wide association studies; High-dimensional inference; Multiple hypothesis testing

资金

  1. NSF [DMS-0704337, DMS-0714554]
  2. NIH [R01-GM072611]
  3. Direct For Mathematical & Physical Scien
  4. Division Of Mathematical Sciences [1206464] Funding Source: National Science Foundation

向作者/读者索取更多资源

Multiple hypothesis testing is a fundamental problem in high-dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any single-nucleotide polymorphisms (SNPs) are associated with some traits and those tests are correlated. When test statistics are correlated, false discovery control becomes very challenging under arbitrary dependence. In this article, we propose a novel method-based on principal factor approximation-that successfully subtracts the common dependence and weakens significantly the correlation structure, to deal with an arbitrary dependence structure. We derive an approximate expression for false discovery proportion (FDP) in large-scale multiple testing when a common threshold is used and provide a consistent estimate of realized FDP. This result has important applications in controlling false discovery rate and FDP. Our estimate of realized FDP compares favorably with Efron's approach, as demonstrated in the simulated examples. Our approach is further illustrated by some real data applications. We also propose a dependence-adjusted procedure that is more powerful than the fixed-threshold procedure. Supplementary material for this article is available online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据