Implementation of a system for fast text search and document comparison

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This chapter presents an architecture of the system for fast text search and documents comparison with main focus on N-gram-based algorithm and its parallel implementation. The algorithm which is one of several computational procedures implemented in the system is used to generate a fingerprint of analyzed documents as a set of hashes which represent the file. This work examines the performance of the system, both in terms of a file comparison quality and a fingerprint generation. Several tests were conducted of N-gram-based algorithm for Intel Xeon E5645, 2.40 GHz which show approximately 8x speedup of multi over single core implementation.

Cite

CITATION STYLE

APA

Wielgosz, M., Janiszewski, M., Russek, P., Pietro, M., Jamro, E., & Wiatr, K. (2014). Implementation of a system for fast text search and document comparison. Studies in Computational Intelligence, 541, 173–186. https://doi.org/10.1007/978-3-319-04714-0_11

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free