An efficient algorithm for mining both closed and maximal frequent free subtrees using canonical forms

3Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A large number of text files, including HTML documents and XML documents, can be organized as tree structures. One objective of data mining is to discover frequent patterns in them. In this paper, first, we introduce a canonical form of free tree, which is based on the breadth-first canonical string; secondly, we present some properties of a closed frequent subtree and a maximal frequent subtree as well as their relationships; thirdly, we study a pruning technique of frequent free subtree and improvement on the mining of the nonclosed frequent free subtree; finally, we present an algorithm that mines all closed and maximal frequent free trees and prove validity of this algorithm. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Guo, P., Zhou, Y., Zhuang, J., Chen, T., & Kang, Y. R. (2005). An efficient algorithm for mining both closed and maximal frequent free subtrees using canonical forms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3584 LNAI, pp. 96–107). Springer Verlag. https://doi.org/10.1007/11527503_13

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free