Refinements to nearest-neighbor searching in k-dimensional trees

Robert F. Sproull

Journal Article

Refinements to nearest-neighbor searching in k-dimensional trees

Sproull R

Algorithmica (1991) 6(1-6) 579-589

DOI: 10.1007/BF01759061

293Citations

109Readers

Get full text

Abstract

This note presents a simplification and generalization of an algorithm for searching k-dimensional trees for nearest neighbors reported by Friedman et al [3]. If the distance between records is measured using L2, the Euclidean norm, the data structure used by the algorithm to determine the bounds of the search space can be simplified to a single number. Moreover, because distance measurements in L2 are rotationally invariant, the algorithm can be generalized to allow a partition plane to have an arbitrary orientation, rather than insisting that it be perpendicular to a coordinate axis, as in the original algorithm. When a k-dimensional tree is built, this plane can be found from the principal eigenvector of the covariance matrix of the records to be partitioned. These techniques and others yield variants of k-dimensional trees customized for specific applications. It is wrong to assume that k-dimensional trees guarantee that a nearest-neighbor query completes in logarithmic expected time. For small k, logarithmic behavior is observed on all but tiny trees. However, for larger k, logarithmic behavior is achievable only with extremely large numbers of records. For k = 16, a search of a k-dimensional tree of 76,000 records examines almost every record. © 1991 Springer-Verlag New York Inc.

Author supplied keywords

Cite

CITATION STYLE

APA

Sproull, R. F. (1991). Refinements to nearest-neighbor searching in k-dimensional trees. Algorithmica, 6(1–6), 579–589. https://doi.org/10.1007/BF01759061

Refinements to nearest-neighbor searching in k-dimensional trees

Abstract

Author supplied keywords

Cite

Register to see more suggestions