Article ID: | iaor201112311 |
Volume: | 28 |
Issue: | 2 |
Start Page Number: | 167 |
End Page Number: | 184 |
Publication Date: | May 2011 |
Journal: | Expert Systems |
Authors: | Day Peter, Nandi Asoke K |
Keywords: | heuristics: genetic algorithms, optimization |
The success of automatic classification is intricately linked with an effective feature selection. Previous studies on the use of genetic programming (GP) to solve classification problems have highlighted its benefits, principally its inherent feature selection (a process that is often performed independent of a learning method). In this paper, the problem of classification is recast as a feature generation problem, where GP is used to evolve programs that allow non-linear combination of features to create superFeatures, from which classification tasks can be achieved fairly easily. In order to generate superFeatures robustly, the binary string fitness characterization along with the comparative partner selection strategy is introduced with the aim of promoting optimal convergence. The techniques introduced are applied to two illustrative problems first and then to the real-world problem of audio source classification, with competitive results.