S2JSD-LSH: A locality-sensitive hashing schema for probability distributions

6Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

To compare the similarity of probability distributions, the information-theoretically motivated metrics like Kullback-Leibler divergence (KL) and Jensen-Shannon divergence (JSD) are often more reasonable compared with metrics for vectors like Euclidean and angular distance. However, existing locality-sensitive hashing (LSH) algorithms cannot support the information-theoretically motivated metrics for probability distributions. In this paper, we first introduce a new approximation formula for S2JSD-distance, and then propose a novel LSH scheme adapted to S2JSD-distance for approximate nearest neighbors search in high-dimensional probability distributions. We define the specific hashing functions, and prove their local-sensitivity. Furthermore, extensive empirical evaluations well illustrate the effectiveness of the proposed hashing schema on six public image datasets and two text datasets, in terms of mean Average Precision, Precision@N and Precision-Recall curve.

Cite

CITATION STYLE

APA

Mao, X. L., Feng, B. S., Hao, Y. J., Nie, L., Huang, H., & Wen, G. (2017). S2JSD-LSH: A locality-sensitive hashing schema for probability distributions. In 31st AAAI Conference on Artificial Intelligence, AAAI 2017 (pp. 3244–3251). AAAI press. https://doi.org/10.1609/aaai.v31i1.10989

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free