Fast result enumeration for keyword queries on XML data

Junfeng Zhou; Ziyang Chen; Xian Tang; Zhifeng Bao; Tok Wang Ling

Journal ArticleOPEN ACCESS

Fast result enumeration for keyword queries on XML data

Journal of Computing Science and Engineering (2012) 6(2) 127-140

DOI: 10.5626/JCSE.2012.6.2.127

2Citations

6Readers

Abstract

In this paper, we focus on efficient construction of tightest matched subtree (TMSubtree) results, for keyword queries on extensible markup language (XML) data, based on smallest lowest common ancestor (SLCA) semantics. Here, "matched" means that all nodes in a returned subtree satisfy the constraint that the set of distinct keywords of the subtree rooted at each node is not subsumed by that of any of its sibling nodes, while "tightest" means that no two subtrees rooted at two sibling nodes can contain the same set of keywords. Assume that d is the depth of a given TMSubtree, m is the number of keywords of a given query Q. We proved that if d ≤ m, a matched subtree result has at most 2m! nodes; otherwise, the size of a matched subtree result is bounded by (d - m + 2)m!. Based on this theoretical result, we propose a pipelined algorithm to construct TMSubtree results without rescanning all node labels. Experiments verify the benefits of our algorithm in aiding keyword search over XML data. © 2012. The Korean Institute of Information Scientists and Engineers.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhou, J., Chen, Z., Tang, X., Bao, Z., & Ling, T. W. (2012). Fast result enumeration for keyword queries on XML data. Journal of Computing Science and Engineering, 6(2), 127–140. https://doi.org/10.5626/JCSE.2012.6.2.127

Fast result enumeration for keyword queries on XML data

Abstract

Author supplied keywords

Cite

Register to see more suggestions