Optimizing path query performance: Graph clustering strategies

0.00 Avg rating—0 Votes

Article ID:	iaor20012838
Country:	United Kingdom
Volume:	8C
Issue:	1/6
Start Page Number:	381
End Page Number:	408
Publication Date:	Feb 2000
Journal:	Transportation Research. Part C, Emerging Technologies
Authors:	Huang Yun-Wu, Jing Ning, Rundensteiner Elke A.
Keywords:	geographical information systems

Abstract:

Path queries over transportation networks are operations required by many Geographic Information Systems applications. Such networks, typically modeled as graphs composed of nodes and links and represented as link relations, can be very large and hence often need to be stored on secondary storage devices. Path query computation over such large persistent networks amounts to high I/O costs due to having to repeatedly bring in links from the link relation from secondary storage into the main memory buffer for processing. This paper is the first to present a comparative experimental evaluation of alternative graph clustering solutions in order to show their effectiveness in path query processing over transportation networks. Clustering optimization is attractive because it does not incur any run-time cost, requires no auxiliary data structures, and is complementary to many of the existing solutions on path query processing. In this paper, we develop a novel clustering technique, called spatial partition clustering (SPC), that exploits unique properties of transportation networks such as spatial coordinates and high locality. We identify other promising candidates for clustering optimizations from the literature, such as two-way partitioning and approximate topological clustering. We fine-tune them to optimize their I/O behavior for path query processing. Our experimental evaluation of the performance of these graph clustering techniques using an actual city road network as well as randomly generated graphs considers variations in parameters such as memory buffer size, length of the paths, locality, and out-degree. Our experimental results are the foundation for establishing guidelines to select the best clustering technique based on the type of networks.

Reviews

Required fields are marked *. Your email address will not be published.