Basic triangle inequality approach versus metric VP-tree and projection in determining euclidean and cosine neighbors

2Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The Euclidean distance and the cosine similarity are often applied for clustering or classifying objects or simply for determining most similar objects or nearest neighbors. In fact, the determination of nearest neighbors is typically a subtask of both clustering and classification. In this chapter, we discuss three principal approaches to efficient determination of nearest neighbors: namely, using the triangle inequality when vectors are ordered with respect to their distances to one reference vector, using a metric VP-tree and using a projection onto a dimension. Also, we discuss a combined application of a number of reference vectors and/or projections onto dimensions and compare two variants of VP-tree. The techniques are well suited to any distance metrics such as the Euclidean distance, but they cannot be directly used for searching nearest neighbors with respect to the cosine similarity. However, we have shown recently that the problem of determining a cosine similarity neighborhood can be transformed to the problem of determining a Euclidean neighborhood among normalized forms of original vectors. In this chapter, we provide an experimental comparison of the discussed techniques for determining nearest neighbors with regard to the Euclidean distance and the cosine similarity.

Cite

CITATION STYLE

APA

Kryszkiewicz, M., & Jańczak, B. (2014). Basic triangle inequality approach versus metric VP-tree and projection in determining euclidean and cosine neighbors. Studies in Computational Intelligence, 541, 27–49. https://doi.org/10.1007/978-3-319-04714-0_3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free