4.5 Article

Robust and Distributionally Robust Optimization Models for Linear Support Vector Machine

期刊

COMPUTERS & OPERATIONS RESEARCH
卷 147, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.cor.2022.105930

关键词

Machine Learning; Support Vector Machine; Robust optimization; Distributionally robust optimization

向作者/读者索取更多资源

This paper presents novel data-driven optimization models for improving the classification performance of Support Vector Machines (SVM). By introducing uncertainty sets and robust optimization models, more reliable classification can be achieved in real-life noisy data. Experimental results show that this method is particularly beneficial for data sets with a small number of observations and can improve out-of-sample accuracy as the dimension of the data sets increases.
In this paper we present novel data-driven optimization models for Support Vector Machines (SVM), with the aim of linearly separating two sets of points that have non-disjoint convex closures. Traditional classification algorithms assume that the training data points are always known exactly. However, real-life data are often subject to noise. To handle such uncertainty, we formulate robust models with uncertainty sets in the form of hyperrectangles or hyperellipsoids, and propose a moment-based distributionally robust optimization model enforcing limits on first-order deviations along principal directions. All the formulations reduce to convex programs. The efficiency of the new classifiers is evaluated on real-world databases. Experiments show that robust classifiers are especially beneficial for data sets with a small number of observations. As the dimension of the data sets increases, features behavior is gradually learned and higher levels of out-of-sample accuracy can be achieved via the considered distributionally robust optimization method. The proposed formulations, overall, allow finding a trade-off between increasing the average performance accuracy and protecting against uncertainty, with respect to deterministic approaches.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据