☆ 4.3 Article

A BOOTSTRAP LASSO plus PARTIAL RIDGE METHOD TO CONSTRUCT CONFIDENCE INTERVALS FOR PARAMETERS IN HIGH-DIMENSIONAL SPARSE LINEAR MODELS

STATISTICA SINICA (2020)

Journal

STATISTICA SINICA

Volume 30, Issue 3, Pages 1333-1355

Publisher

STATISTICA SINICA

DOI: 10.5705/ss.202018.0131

Keywords

Bootstrap; confidence interval; high-dimensional inference; Lasso plus partial ridge; model selection consistency

Funding

NSF [DMS-1613002, DMS-1228246, DMS-1613338, DBI-1846216]
AFOSR [FA955014-1-0016]
National Natural Science Foundation of China [11701316]
Hellman Fellowship
PhRMA Foundation Research Starter Grant in Informatics
Sloan Research Fellowship
Johnson & Johnson WiSTEM2D Award
NIH/NIGMS [R01GM120507]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Constructing confidence intervals for the coefficients of high-dimensional sparse linear models remains a challenge, mainly because of the complicated limiting distributions of the widely used estimators, such as the lasso. Several methods have been developed for constructing such intervals. Bootstrap lasso+ols is notable for its technical simplicity, good interpretability, and performance that is comparable with that of other more complicated methods. However, bootstrap lasso+ols depends on the beta-min assumption, a theoretic criterion that is often violated in practice. Thus, we introduce a new method, called bootstrap lasso+partial ridge, to relax this assumption. Lasso+partial ridge is a two-stage estimator. First, the lasso is used to select features. Then, the partial ridge is used to refit the coefficients. Simulation results show that bootstrap lasso+partial ridge outperforms bootstrap lasso+ols when there exist small, but nonzero coefficients, a common situation that violates the beta-min assumption. For such coefficients, the confidence intervals constructed using bootstrap lasso+partial ridge have, on average, 50% larger coverage probabilities than those of bootstrap lasso+ols. Bootstrap lasso+partial ridge also has, on average, 35% shorter confidence interval lengths than those of the desparsified lasso methods, regardless of whether the linear models are misspecified. Additionally, we provide theoretical guarantees for bootstrap lasso+partial ridge under appropriate conditions, and implement it in the R package HDCI.

A BOOTSTRAP LASSO plus PARTIAL RIDGE METHOD TO CONSTRUCT CONFIDENCE INTERVALS FOR PARAMETERS IN HIGH-DIMENSIONAL SPARSE LINEAR MODELS

Journal

STATISTICA SINICA

Publisher

STATISTICA SINICA

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A BOOTSTRAP LASSO plus PARTIAL RIDGE METHOD TO CONSTRUCT CONFIDENCE INTERVALS FOR PARAMETERS IN HIGH-DIMENSIONAL SPARSE LINEAR MODELS

Journal

STATISTICA SINICA

Publisher

STATISTICA SINICA

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper