TY - GEN

T1 - A support vector method for clustering

AU - Ben-Hur, Asa

AU - Horn, David

AU - Siegelmann, Hava T.

AU - Vapnik, Vladimir

PY - 2001

Y1 - 2001

N2 - We present a novel method for clustering using the support vector machine approach. Data points are mapped to a high dimensional feature space, where support vectors are used to define a sphere enclosing them. The boundary of the sphere forms in data space a set of closed contours containing the data. Data points enclosed by each contour are defined as a cluster. As the width parameter of the Gaussian kernel is decreased, these contours fit the data more tightly and splitting of contours occurs. The algorithm works by separating clusters according to valleys in the underlying probability distribution, and thus clusters can take on arbitrary geometrical shapes. As in other SV algorithms, outliers can be dealt with by introducing a soft margin constant leading to smoother cluster boundaries. The structure of the data is explored by varying the two parameters. We investigate the dependence of our method on these parameters and apply it to several data sets.

AB - We present a novel method for clustering using the support vector machine approach. Data points are mapped to a high dimensional feature space, where support vectors are used to define a sphere enclosing them. The boundary of the sphere forms in data space a set of closed contours containing the data. Data points enclosed by each contour are defined as a cluster. As the width parameter of the Gaussian kernel is decreased, these contours fit the data more tightly and splitting of contours occurs. The algorithm works by separating clusters according to valleys in the underlying probability distribution, and thus clusters can take on arbitrary geometrical shapes. As in other SV algorithms, outliers can be dealt with by introducing a soft margin constant leading to smoother cluster boundaries. The structure of the data is explored by varying the two parameters. We investigate the dependence of our method on these parameters and apply it to several data sets.

UR - http://www.scopus.com/inward/record.url?scp=84899014502&partnerID=8YFLogxK

M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???

AN - SCOPUS:84899014502

SN - 0262122413

SN - 9780262122412

T3 - Advances in Neural Information Processing Systems

BT - Advances in Neural Information Processing Systems 13 - Proceedings of the 2000 Conference, NIPS 2000

PB - Neural information processing systems foundation

T2 - 14th Annual Neural Information Processing Systems Conference, NIPS 2000

Y2 - 27 November 2000 through 2 December 2000

ER -