4.2 Article

Variable Selection With Second-Generation P-Values

期刊

AMERICAN STATISTICIAN
卷 76, 期 2, 页码 91-101

出版社

TAYLOR & FRANCIS INC
DOI: 10.1080/00031305.2021.1946150

关键词

Lasso; Penalized regression; Second-generation p-values; Variable selection

向作者/读者索取更多资源

ProSGPV is a novel variable selection approach that strikes a good balance between inference and prediction tasks, by using second-generation p-values and l(0) penalization scheme to determine variables, achieving better performance than traditional methods.
Many statistical methods have been proposed for variable selection in the past century, but few balance inference and prediction tasks well. Here, we report on a novel variable selection approach called penalized regression with second-generation p-values (ProSGPV). It captures the true model at the best rate achieved by current standards, is easy to implement in practice, and often yields the smallest parameter estimation error. The idea is to use an l(0) penalization scheme with second-generation p-values (SGPV), instead of traditional ones, to determine which variables remain in a model. The approach yields tangible advantages for balancing support recovery, parameter estimation, and prediction tasks. The ProSGPV algorithm can maintain its good performance even when there is strong collinearity among features or when a high-dimensional feature space with p > n is considered. We present extensive simulations and a real-world application comparing the ProSGPV approach with smoothly clipped absolute deviation (SCAD), adaptive lasso (AL), and minimax concave penalty with penalized linear unbiased selection (MC+). While the last three algorithms are among the current standards for variable selection, ProSGPV has superior inference performance and comparable prediction performance in certain scenarios.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据