Redundant bit vectors for quickly searching high-dimensional regions

17Citations
Citations of this article
38Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Applications such as audio fingerprinting require search in high dimensions: find an item in a database that is similar to a query. An important property of this search task is that negative answers are very frequent: much of the time, a query does not correspond to any database item. We propose Redundant Bit Vectors (RBVs): a novel method for quickly solving this search problem. RBVs rely on three key ideas: 1) approximate the high-dimensional regions/distributions as tightened hyperrectangles, 2) partition the query space to store each item redundantly in an index and 3) use bit vectors to store and search the index efficiently. We show that our method is the preferred method for very large databases or when the queries are often not in the database. Our method is 109 times faster than linear scan, and 48 times faster than locality-sensitive hashing on a data set of 239369 audio fingerprints. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Goldstein, J., Platt, J. C., & Burges, C. J. C. (2005). Redundant bit vectors for quickly searching high-dimensional regions. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3635 LNAI, pp. 137–158). https://doi.org/10.1007/11559887_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free