A text semantic similarity approach for Arabic paraphrase detection

Adnen Mahmoud; Ahmed Zrigui; Mounir Zrigui

Conference Proceedings

A text semantic similarity approach for Arabic paraphrase detection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10762 LNCS 338-349

DOI: 10.1007/978-3-319-77116-8_25

18Citations

19Readers

Get full text

Abstract

The main challenge of paraphrase is how to detect the semantic relationship between the suspect text document and the source text document. Nowadays, the combination of Natural Language Processing NLP and deep learning based approaches have a booming in the field of text analysis, including: text classification, machine translation, text similarity detection, etc. In this context, we proposed a deep learning based method to detect Arabic paraphrase composed by the following phases: First, we started with a preprocessing phase by extracting the relevant information from text document. Then, word2vec algorithm was used to generate word vectors representation which they would be combined subsequently to generate a sentence vectors representation. Finally, we used a Convolutional Neural Network CNN to improve the ability to capture statistical regularities in the context of sentences which then makes it possible to facilitate the similarity measurement operation between the representations of source and suspicious sentences. The evaluation of our proposed approach gave us a promising result in term of precision.

Author supplied keywords

Cite

CITATION STYLE

APA

Mahmoud, A., Zrigui, A., & Zrigui, M. (2018). A text semantic similarity approach for Arabic paraphrase detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10762 LNCS, pp. 338–349). Springer Verlag. https://doi.org/10.1007/978-3-319-77116-8_25

A text semantic similarity approach for Arabic paraphrase detection

Abstract

Author supplied keywords

Cite

Register to see more suggestions