WAT: Finding top-K discords in time series database

Yingyi Bu; Tat Wing Leung; Ada Wai Chee Fu; Eamonn Keogh; Jian Pei; Sam Meshkin

Conference Proceedings

WAT: Finding top-K discords in time series database

Proceedings of the 7th SIAM International Conference on Data Mining (2007) 449-454

DOI: 10.1137/1.9781611972771.43

80Citations

57Readers

Get full text

Abstract

Finding discords in time series database is an important problem in a great variety of applications, such as space shuttle telemetry, mechanical industry, biomedicine, and financial data analysis. However, most previous methods for this problem suffer from too many parameter settings which are difficult for users. The best known approach to our knowledge that has comparatively fewer parameters still requires users to choose a word size for the compression of subsequences. In this paper, we propose a Haar wavelet and augmented trie based algorithm to mine the top-K discords from a time series database, which can dynamically determine the word size for compression. Due to the characteristics of Haar wavelet transform, our algorithm has greater pruning power than previous approaches. Through experiments with some annotated datasets, the effectiveness and efficiency of our algorithm are both attested.

Cite

CITATION STYLE

APA

Bu, Y., Leung, T. W., Fu, A. W. C., Keogh, E., Pei, J., & Meshkin, S. (2007). WAT: Finding top-K discords in time series database. In Proceedings of the 7th SIAM International Conference on Data Mining (pp. 449–454). Society for Industrial and Applied Mathematics Publications. https://doi.org/10.1137/1.9781611972771.43

WAT: Finding top-K discords in time series database

Abstract

Cite

Register to see more suggestions