Article ID: | iaor20041896 |
Country: | United Kingdom |
Volume: | 54 |
Issue: | 7 |
Start Page Number: | 790 |
End Page Number: | 797 |
Publication Date: | Jul 2003 |
Journal: | Journal of the Operational Research Society |
Authors: | Osei-Bryson K.-M., Giles K., Kositanurit B. |
Keywords: | classification |
In the Knowledge Discovery Process, classification algorithms are often used to help create models with training data that can be used to predict the classes of untested data instances. While there are several factors involved with classification algorithms that can influence classification results, such as the node splitting measures used in making decision trees, feature selection is often used as a pre-classification step when using large data sets to help eliminate irrelevant or redundant attributes in order to increase computational efficiency and possible to increase classification accuracy. One important factor common to both feature selection as well as to classification using decision trees is attribute discretization, which is the process of dividing attribute values into a smaller number of discrete values. In this paper, we will present and explore a new hybrid approach, ChiBlur, which involves the use of concepts from both the blurring and χ2-based approaches to feature selection, as well as concepts from multi-objective optimization. We will compare this new algorithm with algorithms based on the blurring and χ2-based approaches.