Scalable high-dimensional indexing with Hadoop

6Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.
Get full text

Abstract

While high-dimensional search-by-similarity techniques reached their maturity and in overall provide good performance, most of them are unable to cope with very large multimedia collections. The 'big data' challenge however has to be addressed as multimedia collections have been explosively growing and will grow even faster than ever within the next few years. Luckily, computational processing power has become more available to researchers due to easier access to distributed grid infrastructures. In this paper, we show how high-dimensional indexing methods can be used on scientific grid environments and present a scalable workflow for indexing and searching over 30 billion SIFT descriptors using a cluster running Hadoop. Our findings could help other researchers and practitioners to cope with huge multimedia collections. © 2013 IEEE.

Cite

CITATION STYLE

APA

Shestakov, D., Moise, D., Gudmundsson, G., & Amsaleg, L. (2013). Scalable high-dimensional indexing with Hadoop. In Proceedings - International Workshop on Content-Based Multimedia Indexing (pp. 207–212). https://doi.org/10.1109/CBMI.2013.6576584

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free