Article ID: | iaor20097347 |
Country: | United Kingdom |
Volume: | 59 |
Issue: | 7 |
Start Page Number: | 956 |
End Page Number: | 962 |
Publication Date: | Jul 2008 |
Journal: | Journal of the Operational Research Society |
Authors: | Hand D J, Whitrow C, Adams N M, Juszczak P, Weston D |
Keywords: | datamining, measurement |
In predictive data mining, algorithms will be both optimized and compared using a measure of predictive performance. Different measures will yield different results, and it follows that it is crucial to match the measure to the true objectives. In this paper, we explore the desirable characteristics of measures for constructing and evaluating tools for mining plastic card data to detect fraud. We define two measures, one based on minimizing the overall cost to the card company, and the other based on minimizing the amount of fraud given the maximum number of investigations the card company can afford to make. We also describe a plot, analogous to the standard ROC, for displaying the performance trace of an algorithm as the relative costs of the two different kinds of misclassification—classing a fraudulent transaction as legitimate or vice versa—are varied.