This paper addresses the problem of indexing high dimensional normalized histogram data, i.e., D-dimensional feature vectors H where ∑i=1D Hi = 1. These are often used as representations for multimedia objects in order to facilitate similarity query processing. By analyzing properties that are induced by the above constraint and that do not hold in general multi-dimensional spaces we design a new split policy. We show that the performance of similarity queries for normalized histogram data can be significantly improved by exploiting such properties within a simple indexing framework. We are able to process nearest-neighbor queries up to 10 times faster than the SR-tree and 3 times faster than the A-tree. © Springer-Verlag Berlin Heidelberg 2003.
CITATION STYLE
Coman, A., Sander, J., & Nascimento, M. A. (2003). Efficient indexing of high dimensional normalized histograms. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2736, 601–610. https://doi.org/10.1007/978-3-540-45227-0_59
Mendeley helps you to discover research relevant for your work.