Article ID: | iaor201523886 |
Volume: | 30 |
Issue: | 8 |
Start Page Number: | 1409 |
End Page Number: | 1425 |
Publication Date: | Dec 2014 |
Journal: | Quality and Reliability Engineering International |
Authors: | Anderson-Cook Christine M, Jang Dae-Heung |
Keywords: | datamining, graphs, statistics: regression |
Outliers can distort many measures for data analysis. We propose a new set of graphical summaries, called firework plots, as simple tools for evaluating the impact of outliers in data exploration and regression assessment. One variation of the plot focuses on the impact of extreme observations on the mean and standard deviation by using curves that trace the relative contribution to the overall summary as weights for individual observations are changed from 1 to 0 in a univariate data set. Similarly, other variations for bivariate data allow examination of the impact of changing weights on combinations of the correlation coefficient and mean with two‐ or three‐dimensional firework plots. One variation of the plot focuses on the impact on the estimated intercept, the estimated slope, and the estimated standard deviation by using curves based on the relative contribution to the overall summary as weights for individual observations are changed from 1 to 0 in a simple linear regression analysis. Similarly, other variations for a multiple regression allow the practitioner to examine the impact of changing weights on combinations of the estimated regression coefficients and the standard error with the pairwise firework plot matrix.