Article ID: | iaor20122468 |
Volume: | 13 |
Issue: | 3 |
Start Page Number: | 239 |
End Page Number: | 255 |
Publication Date: | Feb 2012 |
Journal: | International Journal of Operational Research |
Authors: | Weber Gerhard-Wilhelm, Yahalom Orly, Volkovich Zeev, Avros Renata |
Keywords: | datamining |
This work addresses the cluster validation problem of determining the 'right' number of clusters. We consider a cluster stability property based on the k‐nearest neighbour type coincidences model. Quality of a clustering is measured by the deviation from this model, where a small deviation indicates a good clustering. The true number of clusters corresponds to the empirical deviation distribution having the shortest right tail. Experiments carried out on synthetic and real data sets demonstrate the effectiveness of our method.