Article ID: | iaor20081520 |
Country: | United States |
Volume: | 55 |
Issue: | 2 |
Start Page Number: | 252 |
End Page Number: | 271 |
Publication Date: | Mar 2007 |
Journal: | Operations Research |
Authors: | Bertsimas Dimitris, Shioda Romy |
Keywords: | programming: linear |
Motivated by the significant advances in integer optimization in the past decade, we introduce mixed-integer optimization methods to the classical statistical problems of classification and regression and construct a software package called CRIO (classification and regression via integer optimization). CRIO separates data points into different polyhedral regions. In classification each region is assigned a class, while in regression each region has its own distinct regression coefficients. Computational experimentations with generated and real data sets show that CRIO is comparable to and often outperforms the current leading methods in classification and regression. We hope that these results illustrate the potential for significant impact of integer optimization methods on computational statistics and data mining.