Skip to content
Conference proceedings

Metric index: An efficient and scalable solution for similarity search

Novak D, Batko M...(+2 more)

2009 2nd International Workshop on Similarity Search and Applications, SISAP 2009 (2009) pp. 65-73

  • 23


    Mendeley users who have this article in their library.
  • 25


    Citations of this article.
  • N/A


    ScienceDirect users who have downloaded this article.
Sign in to save reference


Metric space as a universal and versatile model of similarity can be applied in various areas of non-text information retrieval. However, a general, efficient and scalable solution for metric data management is still a resisting research challenge. We introduce a novel indexing and searching mechanism called metric index (M-Index), that employs practically all known principles of metric space partitioning, pruning and filtering. The heart of the M-Index is a general mapping mechanism that enables to actually store the data in well-established structures such as the B+-tree or even in a distributed storage. We have implemented the M-Index with B+-tree and performed experiments on a combination of five MPEG-7 descriptors in a database of hundreds of thousands digital images. The experiments put under test several M-Index variants and compare them with two orthogonal approaches - the PM-Tree and the iDistance. The trials show that the M-Index outperforms the others in terms of efficiency of search-space pruning, I/O costs, and response times for precise similarity queries. Furthermore, the M-Index demonstrates an excellent ability to keep similar data close in the index which makes its approximation algorithm very efficient-maintaining practically constant response times while preserving a very high recall as the dataset grows.

Author-supplied keywords

  • Approximation
  • Data structure
  • Metric space
  • Scalability
  • Similarity search

Find this document

Get full text

Cite this document

Choose a citation style from the tabs below