Word similarity calculation by using the edit distance metrics with consonant normalization

6Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

Edit distance metrics are widely used for many applications such as string comparison and spelling error corrections. Hamming distance is a metric for two equal length strings and Damerau-Levenshtein distance is a well-known metrics for making spelling corrections through string-to-string comparison. Previous distance metrics seems to be appropriate for alphabetic languages like English and European languages. However, the conventional edit distance criterion is not the best method for agglutinative languages like Korean. The reason is that two or more letter units make a Korean character, which is called as a syllable. This mechanism of syllable-based word construction in the Korean language causes an edit distance calculation to be inefficient. As such, we have explored a new edit distance method by using consonant normalization and the normalization factor.

Cite

CITATION STYLE

APA

Kang, S. S. (2015). Word similarity calculation by using the edit distance metrics with consonant normalization. Journal of Information Processing Systems, 11(4), 573–582. https://doi.org/10.3745/JIPS.04.0018

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free