Article ID: | iaor2003428 |
Country: | United Kingdom |
Volume: | 34 |
Issue: | 4 |
Start Page Number: | 515 |
End Page Number: | 521 |
Publication Date: | Jul 2002 |
Journal: | Accident Analysis and Prevention |
Authors: | Stevenson Mark R., Wang Kui, Lee Andy H., Yau Kelvin K.W. |
Keywords: | statistics: distributions, statistics: empirical, statistics: general |
Many of the data collected on motor vehicle crashes are count data. The standard Poisson regression approach used to model this type of data does not take into account the fact there are few crash events and hence, many observed zeros. In this paper, we applied the zero-inflated Poisson (ZIP) model (which adjusts for the many observed zeros) and the negative binomial (NB) model to analyze young driver motor vehicle crashes. The results of the ZIP regression model are comparable to those from fitting an NB regression model for general over-dispersion. The findings highlight that driver confidence, adventurousness and the frequency of driving prior to licensing are significant predictors of crash outcome in the first 12 months of driving. We encourage researchers, when analyzing motor vehicle crash data, to consider the empirical frequency distribution first and to apply the ZIP and NB models in the presence of extra zeros due, for example, to under-reporting.