4.2 Article

Robust sure independence screening for nonpolynomial dimensional generalized linear models

Journal

SCANDINAVIAN JOURNAL OF STATISTICS
Volume 50, Issue 3, Pages 1232-1262

Publisher

WILEY
DOI: 10.1111/sjos.12628

Keywords

conditional screening; density power divergence; DPD-SIS; high-dimensional statistics; robustness; sure independence screening

Ask authors/readers for more resources

In this paper, we discuss a new robust screening procedure based on MDPDE for variable screening in ultra-high-dimensional GLMs. Our proposed method performs well under pure and contaminated data scenarios. The theoretical motivation and proof for the use of marginal MDPDEs, as well as the derivation of a reliable conditional screening method for GLMs, are also provided.
We consider the problem of variable screening in ultra-high-dimensional generalized linear models (GLMs) of nonpolynomial orders. Since the popular SIS approach is extremely unstable in the presence of contamination and noise, we discuss a new robust screening procedure based on the minimum density power divergence estimator (MDPDE) of the marginal regression coefficients. Our proposed screening procedure performs well under pure and contaminated data scenarios. We provide a theoretical motivation for the use of marginal MDPDEs for variable screening from both population as well as sample aspects; in particular, we prove that the marginal MDPDEs are uniformly consistent leading to the sure screening property of our proposed algorithm. Finally, we propose an appropriate MDPDE-based extension for robust conditional screening in GLMs along with the derivation of its sure screening property. Our proposed methods are illustrated through extensive numerical studies along with an interesting real data application.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available