4.4 Article

A general algorithm for covariance modeling of discrete data

Journal

JOURNAL OF MULTIVARIATE ANALYSIS
Volume 165, Issue -, Pages 86-100

Publisher

ELSEVIER INC
DOI: 10.1016/j.jmva.2017.12.002

Keywords

Factor analysis; Gaussian copula; Graphical model; Overdispersed count data; Species interaction

Funding

  1. University of New South Wales
  2. Australian Research Council [DP130102131, FT120100501]
  3. Australian Research Council [FT120100501] Funding Source: Australian Research Council

Ask authors/readers for more resources

We propose an algorithm that generalizes to discrete data any given covariance modeling algorithm originally intended for Gaussian responses, via a Gaussian copula approach. Covariance modeling is a powerful tool for extracting meaning from multivariate data, and fast algorithms for Gaussian data, such as factor analysis and Gaussian graphical models, are widely available. Our algorithm makes these tools generally available to analysts of discrete data and can combine any likelihood-based covariance modeling method for Gaussian data with any set of discrete marginal distributions. Previously, tools for discrete data were generally specific to one family of distributions or covariance modeling paradigm, or otherwise did not exist. Our algorithm is more flexible than alternate methods, takes advantage of existing fast algorithms for Gaussian data, and simulations suggest that it outperforms competing graphical modeling and factor analysis procedures for count and binomial data. We additionally show that in a Gaussian copula graphical model with discrete margins, conditional independence relationships in the latent Gaussian variables are inherited by the discrete observations. Our method is illustrated with a graphical model and factor analysis on an overdispersed ecological count dataset of species abundances. (C) 2017 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available