4.6 Article

Fast and accurate variational inference for models with many latent variables

Journal

JOURNAL OF ECONOMETRICS
Volume 230, Issue 2, Pages 339-362

Publisher

ELSEVIER SCIENCE SA
DOI: 10.1016/j.jeconom.2021.05.002

Keywords

Latent variable models; Time-varying VAR with stochastic volatility; Large consumer panels; Sub-sampling variational inference; Stochastic gradient ascent

Ask authors/readers for more resources

Models with a large number of latent variables can be difficult to estimate, but variational inference methods provide an attractive solution. This study proposes a tractable variational approximation that combines a parsimonious approximation for the parameter posterior with the exact conditional posterior of the latent variables. The method is more accurate and faster to calibrate, and allows for the implementation of data sub-sampling in variational inference.
Models with a large number of latent variables are often used to utilize the information in big or complex data, but can be difficult to estimate. Variational inference methods provide an attractive solution. These methods use an approximation to the posterior density, yet for large latent variable models existing choices can be inaccurate or slow to calibrate. Here, we propose a family of tractable variational approximations that are more accurate and faster to calibrate for this case. It combines a parsimonious approximation for the parameter posterior with the exact conditional posterior of the latent variables. We derive a simplified expression for the re-parameterization gradient of the variational lower bound, which is the main ingredient of optimization algorithms used for calibration. Implementation only requires exact or approximate generation from the conditional posterior of the latent variables, rather than computation of their density. In effect, our method provides a new way to employ Markov chain Monte Carlo (MCMC) within variational inference. We illustrate using two complex contemporary econometric examples. The first is a nonlinear multivariate state space model for U.S. macroeconomic variables. The second is a random coefficients tobit model applied to two million sales by 20,000 individuals in a consumer panel. In both cases, our approximating family is considerably more accurate than mean field or structured Gaussian approximations, and faster than MCMC. Last, we show how to implement data sub-sampling in variational inference for our approximation, further reducing computation time. MATLAB code implementing the method is provided. (C) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available