Article ID: | iaor2012755 |
Volume: | 39 |
Issue: | 1 |
Start Page Number: | 75 |
End Page Number: | 96 |
Publication Date: | Mar 2012 |
Journal: | Scandinavian Journal of Statistics |
Authors: | Jing Junmei, Koch Inge, Naito Kanta |
Keywords: | estimation, histograms |
We consider the problem of efficiently estimating multivariate densities and their modes for moderate dimensions and an abundance of data. We propose polynomial histograms to solve this estimation problem. We present first- and second-order polynomial histogram estimators for a general d-dimensional setting. Our theoretical results include pointwise bias and variance of these estimators, their asymptotic mean integrated square error (AMISE), and optimal binwidth. The asymptotic performance of the first-order estimator matches that of the kernel density estimator, while the second order has the faster rate of O(n-6/(d+6)). For a bivariate normal setting, we present explicit expressions for the AMISE constants which show the much larger binwidths of the second order estimator and hence also more efficient computations of multivariate densities. We apply polynomial histogram estimators to real data from biotechnology and find the number and location of modes in such data.