Key points are not available for this paper at this time.
In this paper we present novel data-driven optimization models for Support Vector Machines (SVM), with the aim of linearly separating two sets of points that have non-disjoint convex closures. Traditional classification algorithms assume that the training data points are always known exactly. However, real-life data are often subject to noise. To handle such uncertainty, we formulate robust models with uncertainty sets in the form of hyperrectangles or hyperellipsoids, and propose a moment-based distributionally robust optimization model enforcing limits on first-order deviations along principal directions. All the formulations reduce to convex programs. The efficiency of the new classifiers is evaluated on real-world databases. Experiments show that robust classifiers are especially beneficial for data sets with a small number of observations. As the dimension of the data sets increases, features behavior is gradually learned and higher levels of out-of-sample accuracy can be achieved via the considered distributionally robust optimization method. The proposed formulations, overall, allow finding a trade-off between increasing the average performance accuracy and protecting against uncertainty, with respect to deterministic approaches.
Building similarity graph...
Analyzing shared references across papers
Loading...
Daniel Faccini
University of Bergamo
Francesca Maggioni
University of Bergamo
Florian A. Potra
University of Maryland, Baltimore County
Computers & Operations Research
University of Maryland, Baltimore County
University of Bergamo
Building similarity graph...
Analyzing shared references across papers
Loading...
Faccini et al. (Tue,) studied this question.
synapsesocial.com/papers/6a1d925343708a372d5e6160 — DOI: https://doi.org/10.1016/j.cor.2022.105930