4.3 Article

TWO PENALIZED MIXED-INTEGER NONLINEAR PROGRAMMING APPROACHES TO TACKLE MULTICOLLINEARITY AND OUTLIERS EFFECTS IN LINEAR REGRESSION MODELS

期刊

出版社

AMER INST MATHEMATICAL SCIENCES-AIMS
DOI: 10.3934/jimo.2020128

关键词

Regression analysis; multicollinearity; breakdown point; mixed-integer programming; metaheuristic algorithm

资金

  1. Research Council of Semnan University

向作者/读者索取更多资源

Ordinary least-squares estimation is the best strategy in classical regression analysis, but may lead to misleading results if assumptions are violated. Robust estimators are widely used to handle outliers, while multicollinearity is another common problem impacting least-squares estimators. It is crucial to use appropriate estimation methods for addressing these issues.
In classical regression analysis, the ordinary least-squares estimation is the best strategy when the essential assumptions such as normality and independency to the error terms as well as ignorable multicollinearity in the covariates are met. However, if one of these assumptions is violated, then the results may be misleading. Especially, outliers violate the assumption of normally distributed residuals in the least-squares regression. In this situation, robust estimators are widely used because of their lack of sensitivity to outlying data points. Multicollinearity is another common problem in multiple regression models with inappropriate effects on the least-squares estimators. So, it is of great importance to use the estimation methods provided to tackle the mentioned problems. As known, robust regressions are among the popular methods for analyzing the data that are contaminated with outliers. In this guideline, here we suggest two mixed-integer nonlinear optimization models which their solutions can be considered as appropriate estimators when the outliers and multicollinearity simultaneously appear in the data set. Capable to be effectively solved by metaheuristic algorithms, the models are designed based on penalization schemes with the ability of down-weighting or ignoring unusual data and multicollinearity effects. We establish that our models are computationally advantageous in the perspective of the flop count. We also deal with a robust ridge methodology. Finally, three real data sets are analyzed to examine performance of the proposed methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据