Article ID: | iaor20104212 |
Volume: | 22 |
Issue: | 2 |
Start Page Number: | 180 |
End Page Number: | 194 |
Publication Date: | Mar 2010 |
Journal: | INFORMS Journal on Computing |
Authors: | Chaovalitwongse W Art, Chou Chun-An, Berger-Wolf Tanya Y, DasGupta Bhaskar, Sheikh Saad, Ashley Mary V, Caballero Isabel C |
Keywords: | bioinformatics |
With improved tools for collecting genetic data from natural and experimental populations, new opportunities arise to study fundamental biological processes, including behavior, mating systems, adaptive trait evolution, and dispersal patterns. Full use of the newly available genetic data often depends upon reconstructing genealogical relationships of individual organisms, such as sibling reconstruction. This paper presents a new optimization framework for sibling reconstruction from single generation microsatellite genetic data. Our framework is based on assumptions of parsimony and combinatorial concepts of Mendel's inheritance rules. Here, we develop a novel optimization model for sibling reconstruction as a large-scale mixed-integer program (MIP), shown to be a generalization of the set covering problem. We propose a new heuristic approach to efficiently solve this large-scale optimization problem. We test our approach on real biological data as presented in other studies as well as simulated data, and compare our results with other state-of-the-art sibling reconstruction methods. The empirical results show that our approaches are very efficient and outperform other methods while providing the most accurate solutions for two benchmark data sets. The results suggest that our framework can be used as an analytical and computational tool for biologists to better study ecological and evolutionary processes involving knowledge of familial relationships in a wide variety of biological systems.