Article ID: | iaor20084036 |
Country: | Netherlands |
Volume: | 173 |
Issue: | 3 |
Start Page Number: | 910 |
End Page Number: | 920 |
Publication Date: | Sep 2006 |
Journal: | European Journal of Operational Research |
Authors: | Trkay Metin, ney Fadime |
Keywords: | programming: integer, datamining |
This paper presents a new data classification method based on mixed-integer programming. Traditional approaches that are based on partitioning the data sets into two groups perform poorly for multi-class data classification problems. The proposed approach is based on the use of hyper-boxes for defining boundaries of the classes that include all or some of the points in that set. A mixed-integer programming model is developed for representing existence of hyper-boxes and their boundaries. In addition, the relationships among the discrete decisions in the model are represented using propositional logic and then converted to their equivalent integer constraints using Boolean algebra. The proposed approach for multi-class data classification is illustrated on an example problem. The efficiency of the proposed method is tested on the well-known IRIS data set. The computational results on the illustrative example and the IRIS data set show that the proposed method is accurate and efficient on multi-class data classification problems.