Basic triangle inequality approach versus metric VP-tree and projection in determining euclidean and cosine neighbors

Marzena Kryszkiewicz; Bartłomiej Jańczak

Journal Article

Basic triangle inequality approach versus metric VP-tree and projection in determining euclidean and cosine neighbors

Studies in Computational Intelligence (2014) 541 27-49

DOI: 10.1007/978-3-319-04714-0_3

2Citations

1Readers

Get full text

Abstract

The Euclidean distance and the cosine similarity are often applied for clustering or classifying objects or simply for determining most similar objects or nearest neighbors. In fact, the determination of nearest neighbors is typically a subtask of both clustering and classification. In this chapter, we discuss three principal approaches to efficient determination of nearest neighbors: namely, using the triangle inequality when vectors are ordered with respect to their distances to one reference vector, using a metric VP-tree and using a projection onto a dimension. Also, we discuss a combined application of a number of reference vectors and/or projections onto dimensions and compare two variants of VP-tree. The techniques are well suited to any distance metrics such as the Euclidean distance, but they cannot be directly used for searching nearest neighbors with respect to the cosine similarity. However, we have shown recently that the problem of determining a cosine similarity neighborhood can be transformed to the problem of determining a Euclidean neighborhood among normalized forms of original vectors. In this chapter, we provide an experimental comparison of the discussed techniques for determining nearest neighbors with regard to the Euclidean distance and the cosine similarity.

Author supplied keywords

Cite

CITATION STYLE

APA

Kryszkiewicz, M., & Jańczak, B. (2014). Basic triangle inequality approach versus metric VP-tree and projection in determining euclidean and cosine neighbors. Studies in Computational Intelligence, 541, 27–49. https://doi.org/10.1007/978-3-319-04714-0_3

Basic triangle inequality approach versus metric VP-tree and projection in determining euclidean and cosine neighbors

Abstract

Author supplied keywords

Cite

Register to see more suggestions