Discovery of non-induced patterns from sequences

Andrew K.C. Wong; Dennis Zhuang; Gary C.L. Li; En Shiun Annie Lee

Conference ProceedingsOPEN ACCESS

Discovery of non-induced patterns from sequences

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6282 LNBI 149-160

DOI: 10.1007/978-3-642-16001-1_13

4Citations

5Readers

Abstract

Discovering patterns from sequence data has significant impact in genomics, proteomics and business. A problem commonly encountered is that the patterns discovered often contain many redundancies resulted from fake significant patterns induced by their strong statistically significant subpatterns. The concept of statistically induced patterns is proposed to capture these redundancies. An algorithm is then developed to efficiently discover non-induced significant patterns from a large sequence dataset. For performance evaluation, two experiments were conducted to demonstrate a) the seriousness of the problem using synthetic data and b) top non-induced significant patterns discovered from Saccharomyces cerevisiae (Yeast) do correspond to the transcription factor binding sites found by the biologists. The experiments confirm the effectiveness of our method in generating a relatively small set of patterns revealing interesting, unknown information inherent in the sequences. © 2010 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Wong, A. K. C., Zhuang, D., Li, G. C. L., & Lee, E. S. A. (2010). Discovery of non-induced patterns from sequences. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6282 LNBI, pp. 149–160). https://doi.org/10.1007/978-3-642-16001-1_13

Discovery of non-induced patterns from sequences

Abstract

Author supplied keywords

Cite

Register to see more suggestions