4.1 Article

Penalized robust estimators in sparse logistic regression

Journal

TEST
Volume 31, Issue 3, Pages 563-594

Publisher

SPRINGER
DOI: 10.1007/s11749-021-00792-w

Keywords

Logistic regression; Outliers; Penalty functions; Robust estimation; Sparse models

Funding

  1. Universidad de Buenos Aires [20020170100022BA]
  2. ANPCYT at Buenos Aires, Argentina [PICT 2018-00740]
  3. Ministry of Economy, Industry and Competitiveness (MINECO/AEI/FEDER, UE) [MTM2016-76969P]

Ask authors/readers for more resources

This paper focuses on robust and penalized estimation for logistic regression parameters, introducing a class of stable penalized estimators for handling atypical data. The study includes analysis of convergence rates, variable selection capability, and asymptotic distribution of estimators under different penalties, as well as comparison of classical and robust estimators' performance in the presence of outliers through numerical simulations.
Sparse covariates are frequent in classification and regression problems where the task of variable selection is usually of interest. As it is well known, sparse statistical models correspond to situations where there are only a small number of nonzero parameters, and for that reason, they are much easier to interpret than dense ones. In this paper, we focus on the logistic regression model and our aim is to address robust and penalized estimation for the regression parameter. We introduce a family of penalized weighted M-type estimators for the logistic regression parameter that are stable against atypical data. We explore different penalization functions including the so-called Sign penalty. We provide a careful analysis of the estimators convergence rates as well as their variable selection capability and asymptotic distribution for fixed and random penalties. A robust cross-validation criterion is also proposed. Through a numerical study, we compare the finite sample performance of the classical and robust penalized estimators, under different contamination scenarios. The analysis of real datasets enables to investigate the stability of the penalized estimators in the presence of outliers.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available