An efficient algorithm for maximal margin clustering

An efficient algorithm for maximal margin clustering

0.00 Avg rating0 Votes
Article ID: iaor201113488
Volume: 52
Issue: 1
Start Page Number: 123
End Page Number: 137
Publication Date: Jan 2012
Journal: Journal of Global Optimization
Authors: , , , ,
Keywords: complexity, clustering
Abstract:

Maximal margin based frameworks have emerged as a powerful tool for supervised learning. The extension of these ideas to the unsupervised case, however, is problematic since the underlying optimization entails a discrete component. In this paper, we first study the computational complexity of maximal hard margin clustering and show that the hard margin clustering problem can be precisely solved in O(n d+2) time where n is the number of the data points and d is the dimensionality of the input data. However, since it is well known that many datasets commonly ‘express’ themselves primarily in far fewer dimensions, our interest is in evaluating if a careful use of dimensionality reduction can lead to practical and effective algorithms. We build upon these observations and propose a new algorithm that gradually increases the number of features used in the separation model in each iteration, and analyze the convergence properties of this scheme. We report on promising numerical experiments based on a ‘truncated’ version of this approach. Our experiments indicate that for a variety of datasets, good solutions equivalent to those from other existing techniques can be obtained in significantly less time.

Reviews

Required fields are marked *. Your email address will not be published.