Deep Learning Embeddings for Data Series Similarity Search

27Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A key operation for the (increasingly large) data series collection analysis is similarity search. According to recent studies, SAX-based indexes offer state-of-the-art performance for similarity search tasks. However, their performance lags under high-frequency, weakly correlated, excessively noisy, or other dataset-specific properties. In this work, we propose Deep Embedding Approximation (DEA), a novel family of data series summarization techniques based on deep neural networks. Moreover, we describe SEAnet, a novel architecture especially designed for learning DEA, that introduces the Sum of Squares preservation property into the deep network design. Finally, we propose a new sampling strategy, SEASam, that allows SEAnet to effectively train on massive datasets. Comprehensive experiments on 7 diverse synthetic and real datasets verify the advantages of DEA learned using SEAnet, when compared to other state-of-the-art traditional and DEA solutions, in providing high-quality data series summarizations and similarity search results.

Cite

CITATION STYLE

APA

Wang, Q., & Palpanas, T. (2021). Deep Learning Embeddings for Data Series Similarity Search. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1708–1716). Association for Computing Machinery. https://doi.org/10.1145/3447548.3467317

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free