4.6 Article

HIGH-DIMENSIONAL ISING MODEL SELECTION USING l1-REGULARIZED LOGISTIC REGRESSION

Journal

ANNALS OF STATISTICS
Volume 38, Issue 3, Pages 1287-1319

Publisher

INST MATHEMATICAL STATISTICS
DOI: 10.1214/09-AOS691

Keywords

Graphical models; Markov random fields; structure learning; l(1)-regularization; model selection; convex risk minimization; high-dimensional asymptotics

Funding

  1. NSF [IIS-0427206, CCF-0625879, DMS-06-05165, CCF-0545862]
  2. Siebel Scholarship

Ask authors/readers for more resources

We consider the problem of estimating the graph associated with a binary Ising Markov random field. We describe a method based on l(1)-regularized logistic regression, in which the neighborhood of any given node is estimated by performing logistic regression subject to an l(1)-constraint. The method is analyzed under high-dimensional scaling in which both the number of nodes p and maximum neighborhood size d are allowed to grow as a function of the number of observations n. Our main results provide sufficient conditions on the triple (n, p, d) and the model parameters for the method to succeed in consistently estimating the neighborhood of every node in the graph simultaneously. With coherence conditions imposed on the population Fisher information matrix, we prove that consistent neighborhood selection can be obtained for sample sizes n = Omega (d(3) log p) with exponentially decaying error. When these same conditions are imposed directly on the sample matrices, we show that a reduced sample size of n = Omega (d(2) log p) suffices for the method to estimate neighborhoods consistently. Although this paper focuses on the binary graphical models, we indicate how a generalization of the method of the paper would apply to general discrete Markov random fields.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available