CMTreeminer: Mining both closed and maximal frequent subtrees

Yun Chi; Yirong Yang; Yi Xia; Richard R. Muntz

Conference Proceedings

CMTreeminer: Mining both closed and maximal frequent subtrees

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3056 63-73

DOI: 10.1007/978-3-540-24775-3_9

55Citations

20Readers

Get full text

Abstract

Tree structures are used extensively in domains such as computational biology, pattern recognition, XML databases, computer networks, and so on. One important problem in mining databases of trees is to find frequently occurring subtrees. However, because of the combinatorial explosion, the number of frequent subtrees usually grows exponentially with the size of the subtrees. In this paper, we present CMTreeMiner, a computationally efficient algorithm that discovers all closed and maximal frequent subtrees in a database of rooted unordered trees. The algorithm mines both closed and maximal frequent subtrees by traversing an enumeration tree that systematically enumerates all subtrees, while using an enumeration DAG to prune the branches of the enumeration tree that do not correspond to closed or maximal frequent subtrees. The enumeration tree and the enumeration DAG are defined based on a canonical form for rooted unordered trees–the depth-first canonical form (DFCF). We compare the performance of our algorithm with that of PathJoin, a recently published algorithm that mines maximal frequent subtrees.

Author supplied keywords

Cite

CITATION STYLE

APA

Chi, Y., Yang, Y., Xia, Y., & Muntz, R. R. (2004). CMTreeminer: Mining both closed and maximal frequent subtrees. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3056, pp. 63–73). Springer Verlag. https://doi.org/10.1007/978-3-540-24775-3_9

CMTreeminer: Mining both closed and maximal frequent subtrees

Abstract

Author supplied keywords

Cite

Register to see more suggestions