4.6 Article

Optimal and maximin procedures for multiple testing problems

出版社

WILEY
DOI: 10.1111/rssb.12507

关键词

FDR; FWER; infinite linear programming; multiple comparisons; optimal testing; strong control

资金

  1. Israeli Science Foundation [2180/20]

向作者/读者索取更多资源

Multiple testing problems are important in modern statistical analysis, aiming to reject as many false null hypotheses as possible while controlling an overall measure of false discovery. This paper extends the optimal test for a single hypothesis to multiple testing problems and provides maximin rules for complex alternatives. The usefulness of these methods is demonstrated in numerical experiments and clinical trials, showing significant improvements in power.
Multiple testing problems (MTPs) are a staple of modern statistical analysis. The fundamental objective of MTPs is to reject as many false null hypotheses as possible (that is, maximize some notion of power), subject to controlling an overall measure of false discovery, like family-wise error rate (FWER) or false discovery rate (FDR). In this paper we provide generalizations to MTPs of the optimal Neyman-Pearson test for a single hypothesis. We show that for simple hypotheses, for both FWER and FDR and relevant notions of power, finding the optimal multiple testing procedure can be formulated as infinite dimensional binary programs and can in principle be solved for any number of hypotheses. We also characterize maximin rules for complex alternatives, and demonstrate that such rules can be found in practice, leading to improved practical procedures compared to existing alternatives that guarantee strong error control on the entire parameter space. We demonstrate the usefulness of these novel rules for identifying which studies contain signal in numerical experiments as well as in application to clinical trials with multiple studies. In various settings, the increase in power from using optimal and maximin procedures can range from 15% to more than 100%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据