Multiple binary codes for fast approximate similarity search

Shinichi Shirakawa

Journal ArticleOPEN ACCESS

Multiple binary codes for fast approximate similarity search

Shirakawa S

IEICE Transactions on Information and Systems (2015) E98D(3) 671-680

DOI: 10.1587/transinf.2014EDP7212

0Citations

6Readers

Abstract

One of the fast approximate similarity search techniques is a binary hashing method that transforms a real-valued vector into a binary code. The similarity between two binary codes is measured by their Hamming distance. In this method, a hash table is often used when undertaking a constant-time similarity search. The number of accesses to the hash table, however, increases when the number of bits lengthens. In this paper, we consider a method that does not access data with a long Hamming radius by using multiple binary codes. Further, we attempt to integrate the proposed approach and the existing multi-index hashing (MIH) method to accelerate the performance of the similarity search in the Hamming space. Then, we propose a learning method of the binary hash functions for multiple binary codes. We conduct an experiment on similarity search utilizing a dataset of up to 50 million items and show that our proposed method achieves a faster similarity search than that possible with the conventional linear scan and hash table search.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Shirakawa, S. (2015). Multiple binary codes for fast approximate similarity search. IEICE Transactions on Information and Systems, E98D(3), 671–680. https://doi.org/10.1587/transinf.2014EDP7212

Readers' Seniority

PhD / Post grad / Masters / Doc 3

75%

Lecturer / Post doc 1

25%

Readers' Discipline

Computer Science 4

80%

Energy 1

20%

Multiple binary codes for fast approximate similarity search

Abstract

Author supplied keywords

References Powered by Scopus

Locality-sensitive hashing scheme based on p-stable distributions

Similarity estimation techniques from rounding algorithms

Supervised hashing with kernels

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline