Principal component hashing: An accelerated approximate nearest neighbor search

Yusuke Matsushita; Toshikazu Wada

Conference ProceedingsOPEN ACCESS

Principal component hashing: An accelerated approximate nearest neighbor search

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5414 LNCS 374-385

DOI: 10.1007/978-3-540-92957-4_33

23Citations

11Readers

Abstract

Nearest Neighbor (NN) search is a basic algorithm for data mining and machine learning applications. However, its acceleration in high dimensional space is a difficult problem. For solving this problem, approximate NN search algorithms have been investigated. Especially, LSH is getting highlighted recently, because it has a clear relationship between relative error ratio and the computational complexity. However, the p-stable LSH computes hash values independent of the data distributions, and hence, sometimes the search fails or consumes considerably long time. For solving this problem, we propose Principal Component Hashing (PCH), which exploits the distribution of the stored data. Through experiments, we confirmed that PCH is faster than ANN and LSH at the same accuracy. © 2009 Springer Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Matsushita, Y., & Wada, T. (2009). Principal component hashing: An accelerated approximate nearest neighbor search. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5414 LNCS, pp. 374–385). https://doi.org/10.1007/978-3-540-92957-4_33

Principal component hashing: An accelerated approximate nearest neighbor search

Abstract

Author supplied keywords

Cite

Register to see more suggestions