Comprehensive Characterization of an Open Source Document Search Engine

3Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

This work performs a thorough characterization and analysis of the open source Lucene search library. The article describes in detail the architecture, functionality, and micro-Architectural behavior of the search engine, and investigates prominent online document search research issues. In particular, we study how intra-server index partitioning affects the response time and throughput, explore the potential use of low power servers for document search, and examine the sources of performance degradation ands the causes of tail latencies. Some of our main conclusions are the following: (a) intra-server index partitioning can reduce tail latencies but with diminishing benefits as incoming query traffic increases, (b) low power servers given enough partitioning can provide same average and tail response times as conventional high performance servers, (c) index search is a CPU-intensive cache-friendly application, and (d) C-states are the main culprits for performance degradation in document search.

Cite

CITATION STYLE

APA

Hadjilambrou, Z., Kleanthous, M., Antoniou, G., Portero, A., & Sazeides, Y. (2019). Comprehensive Characterization of an Open Source Document Search Engine. ACM Transactions on Architecture and Code Optimization, 16(2). https://doi.org/10.1145/3320346

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free