4.5 Article

Prediction-Oriented Model Selection in Partial Least Squares Path Modeling

Journal

DECISION SCIENCES
Volume 52, Issue 3, Pages 567-607

Publisher

WILEY
DOI: 10.1111/deci.12329

Keywords

Model Selection; Model Selection Criteria; Monte Carlo Simulation; Partial Least Squares Path Modeling (PLS-PM); Prediction

Categories

Ask authors/readers for more resources

The study compares the performance of standard PLS-PM criteria and Information Theory-derived model selection criteria, finding that in-sample criteria can serve as useful substitutes for out-of-sample criteria when there is no holdout sample. The best performing out-of-sample criteria include RMSE and MAD when a holdout sample is available.
Partial least squares path modeling (PLS-PM) has become popular in various disciplines to model structural relationships among latent variables measured by manifest variables. To fully benefit from the predictive capabilities of PLS-PM, researchers must understand the efficacy of predictive metrics used. In this research, we compare the performance of standard PLS-PM criteria and model selection criteria derived from Information Theory, in terms of selecting the best predictive model among a cohort of competing models. We use Monte Carlo simulation to study this question under various sample sizes, effect sizes, item loadings, and model setups. Specifically, we explore whether, and when, the in-sample measures such as the model selection criteria can substitute for out-of-sample criteria that require a holdout sample. Such a substitution is advantageous when creating a holdout causes considerable loss of statistical and predictive power due to an overall small sample. We find that when the researcher does not have the luxury of a holdout sample, and the goal is selecting correctly specified models with low prediction error, the in-sample model selection criteria, in particular the Bayesian Information Criterion (BIC) and Geweke-Meese Criterion (GM), are useful substitutes for out-of-sample criteria. When a holdout sample is available, the best performing out-of-sample criteria include the root mean squared error (RMSE) and mean absolute deviation (MAD). We recommend against using standard the PLS-PM criteria (R-2, Adjusted R-2, and Q(2)), and specifically the out-of-sample mean absolute percentage error (MAPE) for prediction-oriented model selection purposes. Finally, we illustrate the model selection criteria's practical utility using a well-known corporate reputation model.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available