Article ID: | iaor201529953 |
Volume: | 59 |
Issue: | 4 |
Start Page Number: | 40 |
End Page Number: | 46 |
Publication Date: | Mar 2016 |
Journal: | Omega |
Authors: | Pardalos Panos M, Xanthopoulos Petros, Panagopoulos Orestis P, Pappu Vijay |
Keywords: | statistics: general, statistics: inference, optimization |
Datasets with significantly larger number of features, compared to samples, pose a serious challenge in supervised learning. Such datasets arise in various areas including business analytics. In this paper, a new binary classification method called constrained subspace classifier (CSC) is proposed for such high dimensional datasets. CSC improves on an earlier proposed classification method called local subspace classifier (LSC) by accounting for the relative angle between subspaces while approximating the classes with individual subspaces. CSC is formulated as an optimization problem and can be solved by an efficient alternating optimization technique. Classification performance is tested in publicly available datasets. The improvement in classification accuracy over LSC shows the importance of considering the relative angle between the subspaces while approximating the classes. Additionally, CSC appears to be a robust classifier, compared to traditional two step methods that perform feature selection and classification in two distinct steps.