Towards Faster Similarity Search by Dynamic Reordering of Streamed Queries

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Current era of digital data explosion calls for employment of content-based similarity search techniques, since traditional searchable metadata like annotations are not always available. In our work, we focus on a scenario where the similarity search is used in the context of stream processing, which is one of the suitable approaches to deal with huge amounts of data. Our goal is to maximize the throughput of processed queries while a slight delay is acceptable. We propose a technique that dynamically reorders the queries coming from the stream in order to use our caching mechanism in huge data spaces more effectively. We were able to achieve significantly higher throughput compared to the baseline when no reordering and no caching were used. Moreover, our proposal does not incur any additional precision loss of the similarity search, as opposed to some other caching techniques. In addition to the throughput maximization, we also study the potential of trading off the throughput for low delays (waiting times). The proposed technique allows to be parameterized by the amount of the throughput that can be sacrificed.

Author supplied keywords

Cite

CITATION STYLE

APA

Nalepa, F., Batko, M., & Zezula, P. (2018). Towards Faster Similarity Search by Dynamic Reordering of Streamed Queries. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11250 LNCS, pp. 61–88). Springer Verlag. https://doi.org/10.1007/978-3-662-58384-5_3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free