Logical analysis of data: classification with justification

Logical analysis of data: classification with justification

0.00 Avg rating0 Votes
Article ID: iaor20117934
Volume: 188
Issue: 1
Start Page Number: 33
End Page Number: 61
Publication Date: Aug 2011
Journal: Annals of Operations Research
Authors: , , , , ,
Keywords: classification
Abstract:

Learning from examples is a frequently arising challenge, with a large number of algorithms proposed in the classification, data mining and machine learning literature. The evaluation of the quality of such algorithms is frequently carried out ex post, on an experimental basis: their performance is measured either by cross validation on benchmark data sets, or by clinical trials. Few of these approaches evaluate the learning process ex ante, on its own merits. In this paper, we discuss a property of rule‐based classifiers which we call ‘justifiability’, and which focuses on the type of information extracted from the given training set in order to classify new observations. We investigate some interesting mathematical properties of justifiable classifiers. In particular, we establish the existence of justifiable classifiers, and we show that several well‐known learning approaches, such as decision trees or nearest neighbor based methods, automatically provide justifiable classifiers. We also identify maximal subsets of observations which must be classified in the same way by every justifiable classifiers. Finally, we illustrate by a numerical example that using classifiers based on ‘most justifiable’ rules does not seem to lead to overfitting, even though it involves an element of optimization.

Reviews

Required fields are marked *. Your email address will not be published.