Article ID: | iaor20051192 |
Country: | United Kingdom |
Volume: | 55 |
Issue: | 9 |
Start Page Number: | 976 |
End Page Number: | 987 |
Publication Date: | Sep 2004 |
Journal: | Journal of the Operational Research Society |
Authors: | Chen Ja-Shen, Ching Russell K.H., Lin Yi-Shen |
Keywords: | marketing |
The K-means algorithm has been a widely applied clustering technique, especially in the area of marketing research. In spite of its popularity and ability to deal with large volumes of data quickly and efficiently, K-means has its drawbacks, such as its inability to provide good solution quality and robustness. In this paper, an extended study of the K-means algorithm is carried out. We propose a new clustering algorithm that integrates the concepts of hierarchical approaches and the K-means algorithm to yield improved performance in terms of solution quality and robustness. This proposed algorithm and score function are introduced and thoroughly discussed. Comparison studies with the K-means algorithm and three popular K-means initialization methods using five well-known test data sets are also presented. Finally, a business application involving segmenting credit card users demonstrates the algorithm's capability.