The paper deals with the search and analysis of the subsequences in large volume sequences (texts, DNA sequences, etc.). A new algorithm ProMFS for mining frequent sequences is proposed and investigated. It is based on the estimated probabilistic-statistical characteristics of the appearance of elements of the sequence and their order. The algorithm builds a new much shorter sequence and makes decisions on the main sequence in accordance with the results of analysis of the shorter one.
CITATION STYLE
Tumasonis, R., & Dzemyda, G. (2006). Analysis of the Statistical Characteristics in Mining of Frequent Sequences. In Intelligent Information Processing and Web Mining (pp. 377–386). Springer-Verlag. https://doi.org/10.1007/3-540-32392-9_39
Mendeley helps you to discover research relevant for your work.