Related terms extraction from Arabic news corpus using word embedding

4Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Different techniques are used in text mining to analyze data, extract knowledge, information and relations. We aim in this work to extract related terms for specific keywords. In the first step, we extract Arabic keywords from news articles titles using the TF-IDF terms weighting measure. In the next step, we extract the related terms, from both titles and main texts, using Word2Vec model as a word embedding technique. In order to evaluate our proposed approach, we compute the precision values of the extracted terms that are present in Wikipedia articles. The experiments results perform better for the extracted terms from the articles main texts than titles and the international news category has the highest precision value.

Cite

CITATION STYLE

APA

Chouigui, A., Ben Khiroun, O., & Elayeb, B. (2019). Related terms extraction from Arabic news corpus using word embedding. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11231 LNCS, pp. 230–240). Springer Verlag. https://doi.org/10.1007/978-3-030-11683-5_26

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free