4.6 Article

A SIMPLE MEASURE OF CONDITIONAL DEPENDENCE

期刊

ANNALS OF STATISTICS
卷 49, 期 6, 页码 3070-3102

出版社

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/21-AOS2073

关键词

Conditional dependence; nonparametric measures of association; variable selection

资金

  1. NSF [DMS-1608249, DMS-1855484]

向作者/读者索取更多资源

This study introduces a coefficient for measuring the conditional dependence between random variables without distributional assumptions, and develops a new variable selection algorithm based on this coefficient. The method is model-free, parameter-free, and provably consistent under sparsity assumptions, with applications to both synthetic and real data sets.
We propose a coefficient of conditional dependence between two random variables Y and Z given a set of other variables X-1, ..., X-p, based on an i.i.d. sample. The coefficient has a long list of desirable properties, the most important of which is that under absolutely no distributional assumptions, it converges to a limit in [0, 1], where the limit is 0 if and only if Y and Z are conditionally independent given X-1, ..., X-p, and is 1 if and only if Y is equal to a measurable function of Z given X-1, ..., X-p. Moreover, it has a natural interpretation as a nonlinear generalization of the familiar partial R-2 statistic for measuring conditional dependence by regression. Using this statistic, we devise a new variable selection algorithm, called Feature Ordering by Conditional Independence (FOCI), which is model-free, has no tuning parameters, and is provably consistent under sparsity assumptions. A number of applications to synthetic and real data sets are worked out.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据