期刊
ANNALS OF APPLIED STATISTICS
卷 12, 期 2, 页码 1180-1203出版社
INST MATHEMATICAL STATISTICS
DOI: 10.1214/17-AOAS1083
关键词
Differential correlation mining; association mining; biostatistics; genomics; high-dimensional data
资金
- NSF [DGE-1144081, DMS-1127914, DMS-1309619, DMS-1613112, IIS-1633212, DMS-1613072, DMS-1310002]
- NIH [R01 HG009125-01, R01 MH101819-01]
- Division Of Mathematical Sciences
- Direct For Mathematical & Physical Scien [1613112] Funding Source: National Science Foundation
Given data obtained under two sampling conditions, it is often of interest to identify variables that behave differently in one condition than in the other. We introduce a method for differential analysis of second-order behavior called Differential Correlation Mining (DCM). The DCM method identifies differentially correlated sets of variables, with the property that the average pairwise correlation between variables in a set is higher under one sample condition than the other. DCM is based on an iterative search procedure that adaptively updates the size and elements of a candidate variable set. Updates are performed via hypothesis testing of individual variables, based on the asymptotic distribution of their average differential correlation. We investigate the performance of DCM by applying it to simulated data as well as to recent experimental datasets in genomics and brain imaging.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据