July 5, 2022Open Access

Robust and Distributionally Robust Optimization Models for Linear Support Vector Machine

Key Points

Key points are not available for this paper at this time.

Abstract

In this paper we present novel data-driven optimization models for Support Vector Machines (SVM), with the aim of linearly separating two sets of points that have non-disjoint convex closures. Traditional classification algorithms assume that the training data points are always known exactly. However, real-life data are often subject to noise. To handle such uncertainty, we formulate robust models with uncertainty sets in the form of hyperrectangles or hyperellipsoids, and propose a moment-based distributionally robust optimization model enforcing limits on first-order deviations along principal directions. All the formulations reduce to convex programs. The efficiency of the new classifiers is evaluated on real-world databases. Experiments show that robust classifiers are especially beneficial for data sets with a small number of observations. As the dimension of the data sets increases, features behavior is gradually learned and higher levels of out-of-sample accuracy can be achieved via the considered distributionally robust optimization method. The proposed formulations, overall, allow finding a trade-off between increasing the average performance accuracy and protecting against uncertainty, with respect to deterministic approaches.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Faccini et al. (Tue,) studied this question.

synapsesocial.com/papers/6a1d925343708a372d5e6160 https://doi.org/https://doi.org/10.1016/j.cor.2022.105930

Bookmark

View Full Paper