Tunisian dialect Wordnet creation and enrichment using web resources and other Wordnets

Rihab Bouchlaghem; Aymen Elkhlifi; Rim Faiz

Conference Proceedings

Tunisian dialect Wordnet creation and enrichment using web resources and other Wordnets

ANLP 2014 - EMNLP 2014 Workshop on Arabic Natural Language Processing, Proceedings (2014) 104-113

DOI: 10.3115/v1/w14-3613

17Citations

87Readers

Get full text

Abstract

In this paper, we propose TunDiaWN (Tunisian dialect Wordnet) a lexical resource for the dialect language spoken in Tunisia. Our TunDiaWN construction approach is founded, in one hand, on a corpus based method to analyze and extract Tunisian dialect words. A clustering technique is adapted and applied to mine the possible relations existing between the Tunisian dialect extracted words and to group them into meaningful groups. All these suggestions are then evaluated and validated by the experts to perform the resource enrichment task. We reuse other Wordnet versions, mainly for English and Arabic language to propose a new database structure enriched by innovative features and entities.

Cite

CITATION STYLE

APA

Bouchlaghem, R., Elkhlifi, A., & Faiz, R. (2014). Tunisian dialect Wordnet creation and enrichment using web resources and other Wordnets. In ANLP 2014 - EMNLP 2014 Workshop on Arabic Natural Language Processing, Proceedings (pp. 104–113). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-3613

Tunisian dialect Wordnet creation and enrichment using web resources and other Wordnets

Abstract

Cite

Register to see more suggestions