4.5 Article

False discovery rate-controlled multiple testing for union null hypotheses: a knockoff-based approach

期刊

BIOMETRICS
卷 -, 期 -, 页码 -

出版社

WILEY
DOI: 10.1111/biom.13848

关键词

FDR control; heterogeneity; replicability; reproducibility; variable selection

向作者/读者索取更多资源

False discovery rate (FDR) controlling procedures ensure statistical guarantees for replicability in signal identification based on multiple hypotheses testing. This paper introduces a knockoff-based variable selection method (Simultaneous knockoffs) that provides exact FDR control guarantees under finite sample settings. This method identifies mutual signals from multiple independent datasets by jointly considering information from different sources (with potential heterogeneity).
False discovery rate (FDR) controlling procedures provide important statistical guarantees for replicability in signal identification based on multiple hypotheses testing. In many fields of study, FDR controling procedures are used in high-dimensional (HD) analyses to discover features that are truly associated with the outcome. In some recent applications, data on the same set of candidate features are independently collected in multiple different studies. For example, gene expression data are collected at different facilities and with different cohorts, to identify the genetic biomarkers of multiple types of cancers. These studies provide us with opportunities to identify signals by considering information from different sources (with potential heterogeneity) jointly. This paper is about how to provide FDR control guarantees for the tests of union null hypotheses of conditional independence. We present a knockoff-based variable selection method (Simultaneous knockoffs) to identify mutual signals from multiple independent datasets, providing exact FDR control guarantees under finite sample settings. This method can work with very general model settings and test statistics. We demonstrate the performance of this method with extensive numerical studies and two real-data examples.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据