An extended study of the K-means algorithm for data clustering and its applications

0.00 Avg rating—0 Votes

Article ID:	iaor20051192
Country:	United Kingdom
Volume:	55
Issue:	9
Start Page Number:	976
End Page Number:	987
Publication Date:	Sep 2004
Journal:	Journal of the Operational Research Society
Authors:	Chen Ja-Shen, Ching Russell K.H., Lin Yi-Shen
Keywords:	marketing

Abstract:

The K-means algorithm has been a widely applied clustering technique, especially in the area of marketing research. In spite of its popularity and ability to deal with large volumes of data quickly and efficiently, K-means has its drawbacks, such as its inability to provide good solution quality and robustness. In this paper, an extended study of the K-means algorithm is carried out. We propose a new clustering algorithm that integrates the concepts of hierarchical approaches and the K-means algorithm to yield improved performance in terms of solution quality and robustness. This proposed algorithm and score function are introduced and thoroughly discussed. Comparison studies with the K-means algorithm and three popular K-means initialization methods using five well-known test data sets are also presented. Finally, a business application involving segmenting credit card users demonstrates the algorithm's capability.

Reviews

Required fields are marked *. Your email address will not be published.