4.6 Article

LEARNING HIGH-DIMENSIONAL DIRECTED ACYCLIC GRAPHS WITH LATENT AND SELECTION VARIABLES

Journal

ANNALS OF STATISTICS
Volume 40, Issue 1, Pages 294-321

Publisher

INST MATHEMATICAL STATISTICS
DOI: 10.1214/11-AOS940

Keywords

Causal structure learning; FCI algorithm; RFCI algorithm; maximal ancestral graphs (MAGs); partial ancestral graphs (PAGs); high-dimensionality; sparsity; consistency

Funding

  1. Swiss NSF [200021-129972]
  2. U.S. NSF [CRI 0855230]
  3. U.S. NIH [R01 AI032475]
  4. Direct For Computer & Info Scie & Enginr
  5. Division Of Computer and Network Systems [0855230] Funding Source: National Science Foundation
  6. Swiss National Science Foundation (SNF) [200021_129972] Funding Source: Swiss National Science Foundation (SNF)

Ask authors/readers for more resources

We consider the problem of learning causal information between random variables in directed acyclic graphs (DAGs) when allowing arbitrarily many latent and selection variables. The FCI (Fast Causal Inference) algorithm has been explicitly designed to infer conditional independence and causal information in such settings. However, FCI is computationally infeasible for large graphs. We therefore propose the new RFCI algorithm, which is much faster than FCI. In some situations the output of RFCI is slightly less informative, in particular, with respect to conditional independence information. However, we prove that any causal information in the output of RFCI is correct in the asymptotic limit. We also define a class of graphs on which the outputs of FCI and RFCI are identical. We prove consistency of FCI and RFCI in sparse high-dimensional settings, and demonstrate in simulations that the estimation performances of the algorithms are very similar. All software is implemented in the R-package pcalg.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available