Article ID: | iaor20002532 |
Country: | United States |
Volume: | 28 |
Issue: | 5 |
Start Page Number: | 1875 |
End Page Number: | 1905 |
Publication Date: | Jan 1999 |
Journal: | SIAM Journal On Computing |
Authors: | Parker D. Stott, Ram Prasad |
Keywords: | optimization |
We show that the space of all binary Huffman codes for a finite alphabet defines a lattice, ordered by the imbalance of the code trees. Representing code trees as path-length sequences, we show that the imbalance ordering is closely related to a majorization ordering on real-valued sequences that correspond to discrete probability density functions. Furthermore, this tree imbalance is a partial ordering that is consistent with the total orderings given by either the external path length (sum of tree path lengths) or the entropy determined by the tree structure. On the imbalance lattice, we show the weighted path-length of a tree (the usual objective function for Huffman coding) is a submodular function, as is the corresponding function on the majorization lattice. Submodular functions are discrete analogues of convex functions. These results give perspective on Huffman coding and suggest new approaches to coding as optimization over a lattice.