Article ID: | iaor2007358 |
Country: | United States |
Volume: | 16 |
Issue: | 4 |
Start Page Number: | 341 |
End Page Number: | 347 |
Publication Date: | Sep 2004 |
Journal: | INFORMS Journal On Computing |
Authors: | Park Yonil, Spouge John L. |
Keywords: | search |
The theory of the discrete-time Markovian arrival process (DMAP) can be applied to some statistical problems encountered when searching for multiple words in a Markov sequence. Such word searches are often emphasized in studies of the human genome. There are several advantages to the DMAP approach we present. Most notably, its derivations are transparent, and they readily unify disparate results about the exact distributions of overlapping and nonoverlapping word counts. We also present several examples and applications of our theory, including a numerical study using a random DNA dataset from the human genome.