An efficient and fast algorithm for mining frequent patterns on multiple biosequences

Wei Liu; Ling Chen

Conference ProceedingsOPEN ACCESS

An efficient and fast algorithm for mining frequent patterns on multiple biosequences

IFIP Advances in Information and Communication Technology (2011) 344 AICT(PART 1) 178-194

DOI: 10.1007/978-3-642-18333-1_22

0Citations

8Readers

Abstract

Mining frequent patterns on biosequences is one of the important research fields in biological data mining. Traditional frequent pattern mining algorithms may generate large amount of short candidate patterns in the process of mining which cost more computational time and reduce the efficiency. In order to overcome such shortcoming of the traditional algorithms, we present an algorithm named MSPM for fast mining frequent patterns on biosequences. Based on the concept of primary patterns, the algorithm focuses on longer patterns for mining in order to avoid producing lots of short patterns. Meanwhile by using prefix tree of primary frequent patterns, the algorithm can extend the primary patterns and avoid plenty of irrelevant patterns. Experimental results show that MSPM can achieve mining results efficiently and improves the performance. © 2011 IFIP International Federation for Information Processing.

Author supplied keywords

Cite

CITATION STYLE

APA

Liu, W., & Chen, L. (2011). An efficient and fast algorithm for mining frequent patterns on multiple biosequences. In IFIP Advances in Information and Communication Technology (Vol. 344 AICT, pp. 178–194). Springer New York LLC. https://doi.org/10.1007/978-3-642-18333-1_22

An efficient and fast algorithm for mining frequent patterns on multiple biosequences

Abstract

Author supplied keywords

Cite

Register to see more suggestions