☆ 4.4 Article

Variable selection for partially linear models via Bayesian subset modeling with diffusing prior

JOURNAL OF MULTIVARIATE ANALYSIS (2021)

Journal

JOURNAL OF MULTIVARIATE ANALYSIS

Volume 183, Issue -, Pages -

Publisher

ELSEVIER INC

DOI: 10.1016/j.jmva.2021.104733

Keywords

Bayesian variable selection; Difference-based method; Selection consistency; Semiparametric modeling

Funding

NSF, USA [DMS 1820702, DMS 1953196, DMS 2015539]
NIH, USA [R01CA229542, R01 ES019672]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

A new Bayesian variable selection approach for partially linear models is proposed in this paper, addressing issues of estimation error and multicollinearity while maintaining model selection consistency and outperforming existing methods in highly correlated predictor settings. The method utilizes a one-step procedure, employs the difference-based method to reduce the impact from nonparametric component estimation, and incorporates Bayesian subset modeling with diffusing prior (BSM-DP) to shrink the corresponding estimator in the linear component. Simulation studies support the theory and efficiency of the proposed method, with an application in a study of supermarket data demonstrating its superiority over existing methods.

Most existing methods of variable selection in partially linear models (PLM) with ultrahigh dimensional covariates are based on partial residuals, which involve a two-step estimation procedure. While the estimation error produced in the first step may have an impact on the second step, multicollinearity among predictors adds additional challenges in the model selection procedure. In this paper, we propose a new Bayesian variable selection approach for PLM. This new proposal addresses those two issues simultaneously as (1) it is a one-step method which selects variables in PLM, even when the dimension of covariates increases at an exponential rate with the sample size, and (2) the method retains model selection consistency, and outperforms existing ones in the setting of highly correlated predictors. Distinguished from existing ones, our proposed procedure employs the difference-based method to reduce the impact from the estimation of the nonparametric component, and incorporates Bayesian subset modeling with diffusing prior (BSM-DP) to shrink the corresponding estimator in the linear component. The estimation is implemented by Gibbs sampling, and we prove that the posterior probability of the true model being selected converges to one asymptotically. Simulation studies support the theory and the efficiency of our methods as compared to other existing ones, followed by an application in a study of supermarket data. (C) 2021 Elsevier Inc. All rights reserved.

Variable selection for partially linear models via Bayesian subset modeling with diffusing prior

Journal

JOURNAL OF MULTIVARIATE ANALYSIS

Publisher

ELSEVIER INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Variable selection for partially linear models via Bayesian subset modeling with diffusing prior

Journal

JOURNAL OF MULTIVARIATE ANALYSIS

Publisher

ELSEVIER INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper