Clustered chain path index for XML document: Efficiently processing branch queries

Hongqiang Wang; Jianzhong Li; Hongzhi Wang

Conference Proceedings

Clustered chain path index for XML document: Efficiently processing branch queries

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4255 LNCS 474-486

DOI: 10.1007/11912873_49

2Citations

2Readers

Get full text

Abstract

Branch query processing is a core operation of XML query processing. In recent years, a number of stack based twig join algorithms have been proposed to process twig queries based on tag stream index. However, each element is labeled separately in tag stream index, similarity of same structured elements is ignored; besides, algorithms based on tag stream index perform worse on large document. In this paper, we propose a novel index Clustered Chain Path Index (CCPI for brief) based on a novel labeling scheme: Clustered Chain Path labeling. The index provides good properties for efficiently processing branch queries. It also has the same cardinality as 1-index against tree structured XML document. Based on CCPI, we design efficient algorithms KMP-Match-Path to process queries without branches and Related-Path-Segment-Join to process queries with branches. Experimental results show that proposed query processing algorithms based on CCPI outperform other algorithms and have good scalability. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Wang, H., Li, J., & Wang, H. (2006). Clustered chain path index for XML document: Efficiently processing branch queries. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4255 LNCS, pp. 474–486). Springer Verlag. https://doi.org/10.1007/11912873_49

Clustered chain path index for XML document: Efficiently processing branch queries

Abstract

Cite

Register to see more suggestions