Article ID: | iaor20101492 |
Volume: | 174 |
Issue: | 1 |
Start Page Number: | 219 |
End Page Number: | 235 |
Publication Date: | Feb 2010 |
Journal: | Annals of Operations Research |
Authors: | Gray Genetha A, Williams Pamela J, Brown W Michael, Faulon Jean-Loup, Sale Kenneth L |
Keywords: | classification |
New challenges in knowledge extraction include interpreting and classifying data sets while simultaneously considering related information to confirm results or identify false positives. We discuss a data fusion algorithmic framework targeted at this problem. It includes separate base classifiers for each data type and a fusion method for combining the individual classifiers. The fusion method is an extension of current ensemble classification techniques and has the advantage of allowing data to remain in heterogeneous databases. In this paper, we focus on the applicability of such a framework to the protein phosphorylation prediction problem.