Related terms extraction from Arabic news corpus using word embedding

Amina Chouigui; Oussama Ben Khiroun; Bilel Elayeb

Conference Proceedings

Related terms extraction from Arabic news corpus using word embedding

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11231 LNCS 230-240

DOI: 10.1007/978-3-030-11683-5_26

4Citations

8Readers

Get full text

Abstract

Different techniques are used in text mining to analyze data, extract knowledge, information and relations. We aim in this work to extract related terms for specific keywords. In the first step, we extract Arabic keywords from news articles titles using the TF-IDF terms weighting measure. In the next step, we extract the related terms, from both titles and main texts, using Word2Vec model as a word embedding technique. In order to evaluate our proposed approach, we compute the precision values of the extracted terms that are present in Wikipedia articles. The experiments results perform better for the extracted terms from the articles main texts than titles and the international news category has the highest precision value.

Author supplied keywords

Cite

CITATION STYLE

APA

Chouigui, A., Ben Khiroun, O., & Elayeb, B. (2019). Related terms extraction from Arabic news corpus using word embedding. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11231 LNCS, pp. 230–240). Springer Verlag. https://doi.org/10.1007/978-3-030-11683-5_26

Related terms extraction from Arabic news corpus using word embedding

Abstract

Author supplied keywords

Cite

Register to see more suggestions