Parallel Frequent Subtrees Mining Method by an Effective Edge Division Strategy

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.

Abstract

Most data with a complicated structure can be represented by a tree structure. Parallel processing is essential to mining frequent subtrees from massive data in a timely manner. However, only a few algorithms could be transplanted to a parallel framework. A new parallel algorithm is proposed to mine frequent subtrees by grouping strategy (GS) and edge division strategy (EDS). The main idea of GS is dividing edges according to different intervals and then dividing subtrees consisting of the edges in different intervals to their corresponding groups. Besides, the compression stage in mining is optimized by avoiding all candidate subtrees of a compression tree, which reduces the mining time on the nodes. Load balancing can improve the performance of parallel computing. An effective EDS is proposed to achieve load balancing. EDS divides the edges with different fre-quencies into different intervals reasonably, which directly affects the task amount in each computing node. Experiments demonstrate that the proposed algorithm can implement parallel mining, and it outperforms other compared methods on load balancing and speedup.

Cite

CITATION STYLE

APA

Wang, J., & Li, X. (2022). Parallel Frequent Subtrees Mining Method by an Effective Edge Division Strategy. Applied Sciences (Switzerland), 12(9). https://doi.org/10.3390/app12094778

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free