4.6 Article

Bayesian Gaussian Copula Factor Models for Mixed Data

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
卷 108, 期 502, 页码 656-665

出版社

TAYLOR & FRANCIS INC
DOI: 10.1080/01621459.2012.762328

关键词

Extended rank likelihood; Factor analysis; High dimensional; Latent variables; Parameter expansion; Semiparametric

资金

  1. Measurement to Understand Re-Classification of Disease of Cabarrus and Kannapolis (MURDOCK) Study
  2. NIH (National Institutes of Health) CTSA (Clinical and Translational Science Award) [1UL1RR024128-01]
  3. NIH [R01 ES017436]

向作者/读者索取更多资源

Gaussian factor models have proven widely useful for parsimoniously characterizing dependence in multivariate data. There is rich literature on their extension to mixed categorical and continuous variables, using latent Gaussian variables or through generalized latent trait models accommodating measurements in the exponential family. However, when generalizing to non-Gaussian measured variables, the latent variables typically influence both the dependence structure and the form of the marginal distributions, complicating interpretation and introducing artifacts. To address this problem, we propose a novel class of Bayesian Gaussian copula factor models that decouple the latent factors from the marginal distributions. A semiparametric specification for the marginals based on the extended rank likelihood yields straightforward implementation and substantial computational gains. We provide new theoretical and empirical justifications for using this likelihood in Bayesian inference. We propose new default priors for the factor loadings and develop efficient parameter-expanded Gibbs sampling for posterior computation. The methods are evaluated through simulations and applied to a dataset in political science. The models in this article are implemented in the R package bfa (available from http://stat.duke.edu/jsm38/software/bfa). Supplementary materials for this article are available online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据