4.5 Article

Improved high-dimensional regression models with matrix approximations applied to the comparative case studies with support vector machines

Journal

OPTIMIZATION METHODS & SOFTWARE
Volume 37, Issue 5, Pages 1912-1929

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/10556788.2021.2022144

Keywords

Robustness and sensitivity analysis; regression analysis; penalized method; rank-one update; diagonal approximation; metaheuristic algorithm

Ask authors/readers for more resources

In regression analysis, the challenges of high-dimensional data include issues such as the rank of the design matrix, outliers, and ill-conditioning. Penalties can effectively address these challenges, with penalized mixed-integer nonlinear programming models showing promising results in regression analysis for high-dimensional data.
Nowadays, high-dimensional data appear in many practical applications such as biosciences. In the regression analysis literature, the well-known ordinary least-squares estimation may be misleading when the full ranking of the design matrix is missed. As a popular issue, outliers may corrupt normal distribution of the residuals. Thus, since not being sensitive to the outlying data points, robust estimators are frequently applied in confrontation with the issue. Ill-conditioning in high-dimensional data is another common problem in modern regression analysis under which applying the least-squares estimator is hardly possible. So, it is necessary to deal with estimation methods to tackle these problems. As known, a successful approach for high-dimension cases is the penalized scheme with the aim of obtaining a subset of effective explanatory variables that predict the response as the best, while setting the other parameters to zero. Here, we develop several penalized mixed-integer nonlinear programming models to be used in high-dimension regression analysis. The given matrix approximations have simple structures, decreasing computational cost of the models. Moreover, the models are effectively solvable by metaheuristic algorithms. Numerical tests are made to shed light on performance of the proposed methods on simulated and real world high-dimensional data sets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available