An estimation algorithm using distance clustering of data

An estimation algorithm using distance clustering of data

0.00 Avg rating0 Votes
Article ID: iaor20022590
Country: India
Volume: 38
Issue: 5
Start Page Number: 443
End Page Number: 455
Publication Date: Oct 2001
Journal: OPSEARCH
Authors: ,
Keywords: statistics: general
Abstract:

The problem is to predict a value y ∈ Y (output, class) from an observed value of a vector x ∈ X (predictors, inputs, attributes), the relations between y and x given in (empirical) data D − {(xi,yi) : i − 1,...,N}, listing N observed pairs. We propose an estimation algorithm using a classification of D in clusters {Ω1,....,Ωm}, based on a distance function in X × Y. For each cluster Ωi compute the centroid yi of y, and denote the X-projection of Ωi by Ωix. Prediction of y given x ∈ X is done by assigning the point x to a nearest projected cluster, say Ωix, and using yi as estimate for y. Numerical tests show the method, in its basic general form, to give accurate predictions for well-known data sets.

Reviews

Required fields are marked *. Your email address will not be published.