Strongly Agree or Strongly Disagree?: Rating Features in Support Vector Machines

Carrizosa, Emilio; Nogales-Gómez, Amaya; Romero Morales, Dolores; Morales, Dolores Romero

Published in

Elsevier, Information Sciences, (329), p. 256-273

DOI: 10.1016/j.ins.2015.09.031

Tools

Export citation

Search in Google Scholar

Strongly Agree or Strongly Disagree?: Rating Features in Support Vector Machines

Journal article published in 2015 by Emilio Carrizosa

, Amaya Nogales-Gómez, Dolores Romero Morales

, Dolores Romero Morales

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving forbidden

Policy details

Data provided by

Abstract

In linear classifiers, such as the Support Vector Machine (SVM), a score is associated with each feature and objects are assigned to classes based on the linear combination of the scores and the values of the features. Inspired by discrete psychometric scales, which measure the extent to which a factor is in agreement with a statement, we propose the Discrete Level Support Vector Machine (DILSVM) where the feature scores can only take on a discrete number of values, defined by the so-called feature rating levels. The DILSVM classifier benefits from interpretability and it has visual appeal, since it can be represented as a collection of Likert scales, one for each feature, where we rate the level of agreement with the positive class. To construct the DILSVM classifier, we propose a Mixed Integer Linear Programming approach, as well as a collection of strategies to reduce computational cost. Our numerical experiments show that the 3-point and the 5-point DILSVM classifiers have comparable accuracy to the SVM with a substantial gain in interpretability and visual appeal, but also in sparsity, thanks to the appropriate choice of the feature rating levels.

Published in

Links

Tools

Strongly Agree or Strongly Disagree?: Rating Features in Support Vector Machines

Abstract