Article ID: | iaor201525332 |
Volume: | 65 |
Issue: | 12 |
Start Page Number: | 1905 |
End Page Number: | 1919 |
Publication Date: | Dec 2014 |
Journal: | Journal of the Operational Research Society |
Authors: | Sun Jie, Li Hui, Shang Zhiming |
Keywords: | classification, support vector machines |
Previous studies on financial distress prediction (FDP) almost construct FDP models based on a balanced data set, or only use traditional classification methods for FDP modelling based on an imbalanced data set, which often results in an overestimation of an FDP model’s recognition ability for distressed companies. Our study focuses on support vector machine (SVM) methods for FDP based on imbalanced data sets. We propose a new imbalance‐oriented SVM method that combines the synthetic minority over‐sampling technique (SMOTE) with the Bagging ensemble learning algorithm and uses SVM as the base classifier. It is named as SMOTE‐Bagging‐based SVM‐ensemble (SB‐SVM‐ensemble), which is theoretically more effective for FDP modelling based on imbalanced data sets with limited number of samples. For comparative study, the traditional SVM method as well as three classical imbalance‐oriented SVM methods such as cost‐sensitive SVM, SMOTE‐SVM, and data‐set‐partition‐based SVM‐ensemble are also introduced. We collect an imbalanced data set for FDP from the Chinese publicly traded companies, and carry out 100 experiments to empirically test its effectiveness. The experimental results indicate that the new SB‐SVM‐ensemble method outperforms the traditional methods and is a useful tool for imbalanced FDP modelling.