Recursive hashing functions for n-grams

40Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

Many indexing, retrieval, and comparison methods are based on counting or cataloguing n-grams in streams of symbols. The fastest method of implementing such operations is through the use of hash tables. Rapid hashing of consecutive n-grams is best done using a recursive hash function, in which the hash value of the current n-gram is derived from the hash value of its predecessor. This article generalizes recursive hash functions found in the literature and proposes new methods offering superior performance. Experimental results demonstrate substantial speed improvement over conventional approaches, while retaining near-ideal hash value distribution. © 1997 ACM.

Cite

CITATION STYLE

APA

Cohen, J. D. (1997). Recursive hashing functions for n-grams. ACM Transactions on Information Systems, 15(3), 291–320. https://doi.org/10.1145/256163.256168

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free