A fast generative spell corrector based on edit distance

Ishan Chattopadhyaya; Kannappan Sirchabesan; Krishanu Seal

Conference Proceedings

A fast generative spell corrector based on edit distance

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7814 LNCS 404-410

DOI: 10.1007/978-3-642-36973-5_34

1Citations

4Readers

Get full text

Abstract

One of the main challenges in the implementation of web-scale online search systems is the disambiguation of the user input when portions of the input queries are possibly misspelt. Spell correctors that must be integrated with such systems have very stringent restrictions imposed on them; primarily they must possess the ability to handle large volume of concurrent queries and generate relevant spelling suggestions at a very high speed. Often, these systems consist of highend server machines with lots of memory and processing power and the requirement from such spell correctors is to minimize the latency of generating suggestions to a bare minimum. In this paper, we present a spell corrector that we developed to cater to high volume incoming queries for an online search service. It consists of a fast, per-token candidate generator which generates spell suggestions within a distance of two edit operations of an input token. We compare its performance against an n-gram based spell corrector and show that the presented spell candidate generation approach has lower response times. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Chattopadhyaya, I., Sirchabesan, K., & Seal, K. (2013). A fast generative spell corrector based on edit distance. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7814 LNCS, pp. 404–410). https://doi.org/10.1007/978-3-642-36973-5_34

A fast generative spell corrector based on edit distance

Abstract

Cite

Register to see more suggestions