On morphological relatedness

Ahmed Khorsi

Journal Article

On morphological relatedness

Khorsi A

Natural Language Engineering (2013) 19(4) 537-555

DOI: 10.1017/S1351324912000071

6Citations

9Readers

Get full text

Abstract

In this paper, we discuss the results of a new unsupervised and computationally lightweight scoring of how two words are morphologically related to each other. This measure is meant to be an alternative to stemming, radicals (root) extraction, and morphological analysis in a wide range of applications; especially information extraction related ones. Compared to light stemming, which seems to be the most convenient approach for systems with efficiency concerns, our measure does not neglect unconditionally a prefix or a suffix as the light stemming does. Instead, our measure takes into account all letters of the word but with different weights. This prevents the missing of a significant letter. Compared to heavy stemming, morphological analysis, or radicals extraction, which rely on dictionaries and compatibility databases, our measure does not rely on any language-specific morphology knowledge. This makes our approach unsupervised and theoretically language independent and computationally much lighter. Our tests targeted Arabic: a Semitic language recognized to have a complex morphology due to its highly inflectional lexicon. Copyright © 2012 Cambridge University Press.

Cite

CITATION STYLE

APA

Khorsi, A. (2013). On morphological relatedness. Natural Language Engineering, 19(4), 537–555. https://doi.org/10.1017/S1351324912000071

On morphological relatedness

Abstract

Cite

Register to see more suggestions