Article ID: | iaor20041906 |
Country: | Lithuania |
Volume: | 14 |
Issue: | 3 |
Start Page Number: | 277 |
End Page Number: | 288 |
Publication Date: | Jul 2003 |
Journal: | Informatica |
Authors: | Brumen Bostjan, Golob Izidor, Welzer Tatjana, Rozman Ivan, Druzovec Marjan, Jaakkola Hannu |
Keywords: | datamining |
In the paper, we present an algorithm that can be applied to protect data before a data mining process takes place. The data mining, a part of the knowledge discovery process, is mainly about building models from data. We address the following question: can we protect the data and still allow the data modelling process to take place? We consider the case where the distributions of original data values are preserved while the values themselves change, so that the resulting model is equivalent to the one built with original data. The presented formal approach is especially useful when the knowledge discovery process is outsourced. The application of the algorithm is demonstrated through an example.