Constraint based periodic pattern mining in multiple longest common subsequences

5Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

The periodicity search in longest common subsequences in multiple strings has a number of application, is an interesting data mining problem. Periodicity is very common practice in longest common subsequence mining algorithm. This work introduces a new parallel algorithm for finding periodicity in multiple strings. Few existing algorithms lacks in poor scalability, lacks in finding all longest pattern, and for finding symbol, partial and full periodicity. We designed the algorithm using FP-tree for finding periodicity for most common longest substring in multiple sources. We introduce a parallel algorithm for Constraint Based Periodic Pattern Mining (CBPPM) algorithm, which takes O(kN) for finding periodicity and ONLhp××()()/ time for MLCS pattern. We tested parallel algorithm on a coarse-grained multi-computer (BSP/CGM) model with pm< processors that takes ONLp×() space per processor, with Oplog() communication rounds. We derive a practical implementation that works better for arbitrary length of input sequence. The algorithm is noise resilient, and shown its performance in presence of replacement, insertion, deletion, or mixture of these types of noise. We experimented with synthetic and real data reveals a near linear speedup with scalable performance. The comparative study shows algorithm's applicability and effectiveness, generally more noise resilient.

Cite

CITATION STYLE

APA

Karthik, G. M., & Pujeri, R. V. (2013). Constraint based periodic pattern mining in multiple longest common subsequences. Indian Journal of Science and Technology, 6(8), 5046–5057. https://doi.org/10.17485/ijst/2013/v6i8.13

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free