Article ID: | iaor2010852 |
Volume: | 37 |
Issue: | 8 |
Start Page Number: | 1381 |
End Page Number: | 1388 |
Publication Date: | Aug 2010 |
Journal: | Computers and Operations Research |
Authors: | Nascimento Mari C V, Toledo Franklina M B, de Carvalho Andr C P L F |
Keywords: | heuristics |
A large amount of biological data has been produced in the last years. Important knowledge can be extracted from these data by the use of data analysis techniques. Clustering plays an important role in data analysis, by organizing similar objects from a dataset into meaningful groups. Several clustering algorithms have been proposed in the literature. However, each algorithm has its bias, being more adequate for particular datasets. This paper presents a mathematical formulation to support the creation of consistent clusters for biological data. Moreover, it shows a clustering algorithm to solve this formulation that uses GRASP (Greedy Randomized Adaptive Search Procedure). We compared the proposed algorithm with three known other algorithms. The proposed algorithm presented the best clustering results confirmed statistically.