4.6 Article

Standardization of continuous and categorical covariates in sparse penalized regressions

Journal

STATISTICAL METHODS IN MEDICAL RESEARCH
Volume 32, Issue 1, Pages 41-54

Publisher

SAGE PUBLICATIONS LTD
DOI: 10.1177/09622802221129042

Keywords

Standardization; dichotomization; variable selection; sparse penalized regression; categorical covariates; continuous covariates

Ask authors/readers for more resources

This paper proposes a novel standardization method for different data types of covariates in sparse penalized regressions, aiming to solve the problem of different selection probabilities caused by traditional methods. The authors illustrate the advantages of the proposed method through simulation studies and empirical analysis.
In sparse penalized regressions, candidate covariates of different units need to be standardized beforehand so that the coefficient sizes are directly comparable and reflect their relative impacts, which leads to fairer variable selection. However, when covariates of mixed data types (e.g. continuous, binary or categorical) exist in the same dataset, the commonly used standardization methods may lead to different selection probabilities even when the covariates have the same impact on or level of association with the outcome. In this paper, we propose a novel standardization method that targets at generating comparable selection probabilities in sparse penalized regressions for continuous, binary or categorical covariates with the same impact. We illustrate the advantages of the proposed method in simulation studies, and apply it to the National Ambulatory Medical Care Survey data to select factors related to the opioid prescription in the US.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available