A text semantic similarity approach for Arabic paraphrase detection

18Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The main challenge of paraphrase is how to detect the semantic relationship between the suspect text document and the source text document. Nowadays, the combination of Natural Language Processing NLP and deep learning based approaches have a booming in the field of text analysis, including: text classification, machine translation, text similarity detection, etc. In this context, we proposed a deep learning based method to detect Arabic paraphrase composed by the following phases: First, we started with a preprocessing phase by extracting the relevant information from text document. Then, word2vec algorithm was used to generate word vectors representation which they would be combined subsequently to generate a sentence vectors representation. Finally, we used a Convolutional Neural Network CNN to improve the ability to capture statistical regularities in the context of sentences which then makes it possible to facilitate the similarity measurement operation between the representations of source and suspicious sentences. The evaluation of our proposed approach gave us a promising result in term of precision.

Cite

CITATION STYLE

APA

Mahmoud, A., Zrigui, A., & Zrigui, M. (2018). A text semantic similarity approach for Arabic paraphrase detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10762 LNCS, pp. 338–349). Springer Verlag. https://doi.org/10.1007/978-3-319-77116-8_25

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free