☆ 4.6 Article

OPTIMAL FALSE DISCOVERY RATE CONTROL FOR LARGE SCALE MULTIPLE TESTING WITH AUXILIARY INFORMATION

ANNALS OF STATISTICS (2022)

期刊

ANNALS OF STATISTICS

卷 50, 期 2, 页码 807-857

出版社

INST MATHEMATICAL STATISTICS-IMS

DOI: 10.1214/21-AOS2128

关键词

EM algorithm; false discovery rate; isotonic regression; local false discovery rate; multiple testing; Pool-Adjacent-Violators algorithm

类别

Statistics & Probability

资金

NIH [2UL1TR001427-05]
Mayo Clinic Center for Individualized Medicine
NSF [DMS-1830392, DMS1811747]
National Institutes of Health [R21 HG011662]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The article introduces a method to improve the statistical power of large-scale multiple testing by utilizing auxiliary information in high-dimensional statistical inference. By using a framework based on a two-group mixture model and imposing structural relationship constraints and an optimal rejection rule to control the false discovery rate, the method's power is enhanced. The advantages of the proposed method are verified through empirical and theoretical analysis.

Large-scale multiple testing is a fundamental problem in high dimensional statistical inference. It is increasingly common that various types of auxiliary information, reflecting the structural relationship among the hypotheses, are available. Exploiting such auxiliary information can boost statistical power. To this end, we propose a framework based on a two-group mixture model with varying probabilities of being null for different hypotheses a priori, where a shape-constrained relationship is imposed between the auxiliary information and the prior probabilities of being null. An optimal rejection rule is designed to maximize the expected number of true positives when average false discovery rate is controlled. Focusing on the ordered structure, we develop a robust EM algorithm to estimate the prior probabilities of being null and the distribution of p-values under the alternative hypothesis simultaneously. We show that the proposed method has better power than state-of-the-art competitors while controlling the false discovery rate, both empirically and theoretically. Extensive simulations demonstrate the advantage of the proposed method. Datasets from genome-wide association studies are used to illustrate the new methodology.

OPTIMAL FALSE DISCOVERY RATE CONTROL FOR LARGE SCALE MULTIPLE TESTING WITH AUXILIARY INFORMATION

期刊

ANNALS OF STATISTICS

出版社

INST MATHEMATICAL STATISTICS-IMS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

OPTIMAL FALSE DISCOVERY RATE CONTROL FOR LARGE SCALE MULTIPLE TESTING WITH AUXILIARY INFORMATION

期刊

ANNALS OF STATISTICS

出版社

INST MATHEMATICAL STATISTICS-IMS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文