4.4 Article

Using cross-validation methods to select time series models: Promises and pitfalls

出版社

WILEY
DOI: 10.1111/bmsp.12330

关键词

autoregressive model; cross-validation; information criteria; time series; vector autoregressive model

向作者/读者索取更多资源

Vector autoregressive (VAR) modelling is widely used in psychology for time series analysis. However, short time series in psychological studies can lead to overfitting and impair predictive ability. This simulation study found that cross-validation methods, especially blocked CV, are effective in estimating prediction errors and outperform traditional methods like AIC and BIC. CV methods tend to underestimate prediction errors of simpler models, but overestimate prediction errors of VAR models, especially with small sample sizes.
Vector autoregressive (VAR) modelling is widely employed in psychology for time series analyses of dynamic processes. However, the typically short time series in psychological studies can lead to overfitting of VAR models, impairing their predictive ability on unseen samples. Cross-validation (CV) methods are commonly recommended for assessing the predictive ability of statistical models. However, it is unclear how the performance of CV is affected by characteristics of time series data and the fitted models. In this simulation study, we examine the ability of two CV methods, namely,10-fold CV and blocked CV, in estimating the prediction errors of three time series models with increasing complexity (person-mean, AR, and VAR), and evaluate how their performance is affected by data characteristics. We then compare these CV methods to the traditional methods using the Akaike (AIC) and Bayesian (BIC) information criteria in their accuracy of selecting the most predictive models. We find that CV methods tend to underestimate prediction errors of simpler models, but overestimate prediction errors of VAR models, particularly when the number of observations is small. Nonetheless, CV methods, especially blocked CV, generally outperform the AIC and BIC. We conclude our study with a discussion on the implications of the findings and provide helpful guidelines for practice.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据