A novel method for mining frequent subtrees from XML data

Wan Song Zhang; Da Xin Liu; Jian Pei Zhang

Journal Article

A novel method for mining frequent subtrees from XML data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3177 300-305

DOI: 10.1007/978-3-540-28651-6_44

0Citations

2Readers

Get full text

Abstract

In this paper, we focus on the problem of finding frequent subtrees in a large collection of XML data, where both of the patterns and the data are modeled by labeled ordered trees. We present an efficient algorithm RSTMiner that computes all rooted subtrees appearing in a collection of XML data trees with frequent above a user-specified threshold using a special structure Me-tree. In this algorithm, Me-tree is used as a merging tree to supply scheme information for efficient pruning and mining frequent sub-trees. The keys of the algorithm are efficient pruning candidates with Me-Tree structure and incrementally enumerating all rooted sub-trees in canonical form based on a extended right most expansion technique. Experiment results show that RSTMiner algorithm is efficient and scalable. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Zhang, W. S., Liu, D. X., & Zhang, J. P. (2004). A novel method for mining frequent subtrees from XML data. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3177, 300–305. https://doi.org/10.1007/978-3-540-28651-6_44

A novel method for mining frequent subtrees from XML data

Abstract

Cite

Register to see more suggestions