The choice of an appropriate representation remains crucial for mining time series, particularly to reach a good trade-off between the dimensionality reduction and the stored information. Symbolic representations constitute a simple way of reducing the dimensionality by turning time series into sequences of symbols. SAXO is a data-driven symbolic representation of time series which encodes typical distributions of data points. This approach was first introduced as a heuristic algorithm based on a regularized coclustering approach. The main contribution of this article is to formalize SAXO as a hierarchical coclustering approach. The search for the best symbolic representation given the data is turned into a model selection problem. Comparative experiments demonstrate the benefit of the new formalization, which results in representations that drastically improve the compression of data while keeping useful information for classification tasks.
CITATION STYLE
Bondu, A., Boullé, M., & Cornuéjols, A. (2016). Symbolic representation of time series: A hierarchical coclustering formalization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9785 LNCS, pp. 3–16). Springer Verlag. https://doi.org/10.1007/978-3-319-44412-3_1
Mendeley helps you to discover research relevant for your work.