Near-optimal feature selection for large databases

Near-optimal feature selection for large databases

0.00 Avg rating0 Votes
Article ID: iaor200969053
Country: United Kingdom
Volume: 60
Issue: 8
Start Page Number: 1045
End Page Number: 1055
Publication Date: Aug 2009
Journal: Journal of the Operational Research Society
Authors: ,
Keywords: combinatorial optimization, sets
Abstract:

We analyse a new optimization-based approach for feature selection that uses the nested partitions method for combinatorial optimization as a heuristic search procedure to identify good feature subsets. In particular, we show how to improve the performance of the nested partitions method using random sampling of instances. The new approach uses a two-stage sampling scheme that determines the required sample size to guarantee convergence to a near-optimal solution. This approach therefore also has attractive theoretical characteristics. In particular, when the algorithm terminates in finite time, rigorous statements can be made concerning the quality of the final feature subset. Numerical results are reported to illustrate the key results, and show that the new approach is considerably faster than the original nested partitions method and other feature selection methods.

Reviews

Required fields are marked *. Your email address will not be published.