HybridTreeMiner: An efficient algorithm for mining frequent rooted trees and free trees using canonical forms

Yun Chi; Yirong Yang; Richard R. Muntz

Conference Proceedings

HybridTreeMiner: An efficient algorithm for mining frequent rooted trees and free trees using canonical forms

Proceedings of the International Conference on Scientific and Statistical Database Management, SSDBM (2004) 16 11-20

DOI: 10.1109/ssdm.2004.1311189

97Citations

17Readers

Get full text

Abstract

Tree structures are used extensively in domains such as computational biology, pattern recognition, XML databases, computer networks, and so on. In this paper, we present HybridTreeMiner, a computationally efficient algorithm that discovers all frequently occurring subtrees in a database of rooted unordered trees. The algorithm mines frequent subtrees by traversing an enumeration tree that systematically enumerates all subtrees. The enumeration tree is defined based on a novel canonical form for rooted unordered trees-the breadth-first canonical form (BFCF). By extending the definitions of our canonical form and enumeration tree to free trees, our algorithm can efficiently handle databases of free trees as well. We study the performance of our algorithms through extensive experiments based on both synthetic data and datasets from real applications. The experiments show that our algorithm is competitive in comparison to known rooted tree mining algorithms and is faster by one to two orders of magnitudes compared to a known algorithm for mining frequent free trees.

Author supplied keywords

Cite

CITATION STYLE

APA

Chi, Y., Yang, Y., & Muntz, R. R. (2004). HybridTreeMiner: An efficient algorithm for mining frequent rooted trees and free trees using canonical forms. In Proceedings of the International Conference on Scientific and Statistical Database Management, SSDBM (Vol. 16, pp. 11–20). https://doi.org/10.1109/ssdm.2004.1311189

HybridTreeMiner: An efficient algorithm for mining frequent rooted trees and free trees using canonical forms

Abstract

Author supplied keywords

Cite

Register to see more suggestions