Article ID: | iaor20104213 |
Volume: | 22 |
Issue: | 2 |
Start Page Number: | 195 |
End Page Number: | 209 |
Publication Date: | Mar 2010 |
Journal: | INFORMS Journal on Computing |
Authors: | Labb Martine, Catanzaro Daniele, Godi Alessandra |
Keywords: | programming: integer |
Haplotyping estimation from aligned single nucleotide polymorphism fragments has attracted increasing attention in recent years because of its importance in the analysis of fine-scale genetic data. Its application fields range from mapping of complex disease genes to inferring population histories, passing through designing drugs, functional genomics, and pharmacogenetics. The literature proposes several criteria for haplotyping populations, each of them characterized by biological motivations. One of the most important haplotyping criteria is parsimony, which consists of finding the minimum number of haplotypes necessary to explain a given set of genotypes. Parsimonious haplotype estimation is an 𝒩𝒫-hard problem for which the literature has proposed several integer programming (IP) models. Here, we describe a new polynomial-sized IP model based on the concept of class representatives, already used for the coloring problem. We propose valid inequalities to strengthen our model and show, through computational experiments, that our model outperforms the best IP models currently known in literature.