Article ID: | iaor20071040 |
Country: | United States |
Volume: | 18 |
Issue: | 2 |
Start Page Number: | 229 |
End Page Number: | 242 |
Publication Date: | Mar 2006 |
Journal: | INFORMS Journal On Computing |
Authors: | De Prabuddha, Dey Debabrata, Zhang Zhongju |
Keywords: | data warehouse |
The notion of a data warehouse for integrating operational data into a single repository is rapidly becoming popular in modern organizations. An important issue in this context is how often one should synchronize the data warehouse to reflect the changes in the constituent operational data sources. If the synchronization is performed very frequently, the associated cost might be quite high, although the data warehouse would only have a small amount of stale data. On the other hand, if the data warehouse is synchronized infrequently, it might result in costly errors in business decisions arising from the stale data. This paper examines the trade-off between the synchronization and staleness costs and derives the optimal synchronization frequency.