☆ 4.5 Article

Normalizing flows for conditional independence testing

KNOWLEDGE AND INFORMATION SYSTEMS (2023)

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

卷 -, 期 -, 页码 -

出版社

SPRINGER LONDON LTD

DOI: 10.1007/s10115-023-01964

关键词

Conditional independence; Hypothesis testing; Representation learning; Generative models; Normalizing flows; Mixed data

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this study, a novel method called LCIT (Latent representation-based Conditional Independence Test) is introduced for testing conditional independence based on representation learning. LCIT first learns to infer the latent representations of target variables X and Y that contain no information about conditioning variable Z, and then investigates the latent variables for any significant remaining dependencies using a conventional correlation test. LCIT outperforms several state-of-the-art baselines consistently and adapts well to both nonlinear, high-dimensional, and mixed data settings on a diverse collection of synthetic and real data sets.

Detecting conditional independencies plays a key role in several statistical and machine learning tasks, especially in causal discovery algorithms, yet it remains a highly challenging problem due to dimensionality and complex relationships presented in data. In this study, we introduce LCIT (Latent representation-based Conditional Independence Test)-a novel method for conditional independence testing based on representation learning. Our main contribution involves a hypothesis testing framework in which to test for the independence between X and Y given Z, we first learn to infer the latent representations of target variables X and Y that contain no information about the conditioning variable Z. The latent variables are then investigated for any significant remaining dependencies, which can be performed using a conventional correlation test. Moreover, LCIT can also handle discrete and mixed-type data in general by converting discrete variables into the continuous domain via variational dequantization. The empirical evaluations show that LCIT outperforms several state-of-the-art baselines consistently under different evaluation metrics, and is able to adapt really well to both nonlinear, high-dimensional, and mixed data settings on a diverse collection of synthetic and real data sets.

Normalizing flows for conditional independence testing

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

出版社

SPRINGER LONDON LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Normalizing flows for conditional independence testing

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

出版社

SPRINGER LONDON LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文