Metrics–When and Why Nonaveraging Statistics Work

Metrics–When and Why Nonaveraging Statistics Work

0.00 Avg rating0 Votes
Article ID: iaor200954108
Country: United States
Volume: 55
Issue: 1
Start Page Number: 4
End Page Number: 15
Publication Date: Jan 2009
Journal: Management Science
Authors: ,
Keywords: statistics: general
Abstract:

Good metrics are well–defined formulae (often involving averaging) that transmute multiple measures of raw numerical performance (e.g., dollar sales, referrals, number of customers) to create informative summary statistics (e.g., average share of wallet, average customer tenure). Despite myriad uses (benchmarking, monitoring, allocating resources, diagnosing problems, explanatory variables), most uses require metrics that contain information summarizing multiple observations. On this criterion, we show empirically (with people data) that although averaging has remarkable theoretical properties, supposedly inferior nonaveraging metrics (e.g., maximum, variance) are often better. We explain theoretically (with exact proofs) and numerically (with simulations) when and why. For example, when the environment causes a correlation between observed sample sizes (e.g., number of past purchases, projects, observations) and latent underlying parameters (e.g., the likelihood of favorable outcomes), the maximum statistic is a better metric than the mean. We refer to this environmental effect as the Muth effect, which occurs when rational markets provide more opportunities (i.e., more observations) to individuals and organizations with greater innate ability. Moreover, when environments are adverse (e.g., failure–rich), nonaveraging metrics correctly overweight favorable outcomes. We refer to this environmental effect as the Anna Karenina effect, which occurs when less–favorable outcomes convey less information. These environmental effects impact metric construction, selection, and employment.

Reviews

Required fields are marked *. Your email address will not be published.