Using Sentence Similarity Measure for Plagiarism Detection of Arabic Documents

2Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Plagiarism detection it is a challenging task, particularly in natural language texts. Some plagiarism detection tools have been developed for diverse natural languages, especially English. In this paper, we propose, a new plagiarism detection system devoted to Arabic text documents. This system is based on an algorithm that uses a semantic sentence similarity measure. Indeed, the sentence similarity measure aggregates in a linear function between three components: the lexical-based LS including the common words, the semantic-based SS using the synonymy relationships, and the syntactico-semantic- based SSS semantic arguments properties notably semantic argument and thematic role. It measures the semantic similarity between words that play the same syntactic role. Concerning the word-based semantic similarity, an information content-based measure is used to estimate the SS degree between words by exploiting the LMF Arabic standardized dictionary ElMadar. The performance of the proposed system was confirmed through experiments with student thesis reports that promising capabilities in identifying literal and some types of intelligent plagiarism. We also demonstrate its advantages over other plagiarism detection tools, including Aplag.

Cite

CITATION STYLE

APA

Wali, W., Gargouri, B., & Ben Hamadou, A. (2018). Using Sentence Similarity Measure for Plagiarism Detection of Arabic Documents. In Advances in Intelligent Systems and Computing (Vol. 736, pp. 52–62). Springer Verlag. https://doi.org/10.1007/978-3-319-76348-4_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free