This paper presents a method for measuring the compositionality score of multiword expressions (MWEs). Based on Wikipedia (WP) as a lexicon resource, the multiword expressions are identified using the title of Wikipedia articles that are made up of more than one word without further process. Through the semantic representation, this method exploits the hierarchical taxonomy in Wikipedia to represent the concept (single word or multiword) as a feature vector containing the WP articles that belong to concept of categories and sub-categories. The literality and the multiplicative function composition scores are used for measuring the compositionality score of an MWE utilizing the semantic similarity. The proposed method is evaluated by comparing the compositionality score against human judgments (dataset) containing 100 Arabic noun-noun compounds. © Springer-Verlag Berlin Heidelberg 2013.
CITATION STYLE
Saif, A., Ab Aziz, M. J., & Omar, N. (2013). Measuring the Compositionality of Arabic Multiword Expressions. In Communications in Computer and Information Science (Vol. 378 CCIS, pp. 245–256). Springer Verlag. https://doi.org/10.1007/978-3-642-40567-9_21
Mendeley helps you to discover research relevant for your work.