Using Sentence Similarity Measure for Plagiarism Detection of Arabic Documents

Wafa Wali; Bilel Gargouri; Abdelmajid Ben Hamadou

Conference Proceedings

Using Sentence Similarity Measure for Plagiarism Detection of Arabic Documents

Advances in Intelligent Systems and Computing (2018) 736 52-62

DOI: 10.1007/978-3-319-76348-4_6

2Citations

6Readers

Get full text

Abstract

Plagiarism detection it is a challenging task, particularly in natural language texts. Some plagiarism detection tools have been developed for diverse natural languages, especially English. In this paper, we propose, a new plagiarism detection system devoted to Arabic text documents. This system is based on an algorithm that uses a semantic sentence similarity measure. Indeed, the sentence similarity measure aggregates in a linear function between three components: the lexical-based LS including the common words, the semantic-based SS using the synonymy relationships, and the syntactico-semantic- based SSS semantic arguments properties notably semantic argument and thematic role. It measures the semantic similarity between words that play the same syntactic role. Concerning the word-based semantic similarity, an information content-based measure is used to estimate the SS degree between words by exploiting the LMF Arabic standardized dictionary ElMadar. The performance of the proposed system was confirmed through experiments with student thesis reports that promising capabilities in identifying literal and some types of intelligent plagiarism. We also demonstrate its advantages over other plagiarism detection tools, including Aplag.

Author supplied keywords

Cite

CITATION STYLE

APA

Wali, W., Gargouri, B., & Ben Hamadou, A. (2018). Using Sentence Similarity Measure for Plagiarism Detection of Arabic Documents. In Advances in Intelligent Systems and Computing (Vol. 736, pp. 52–62). Springer Verlag. https://doi.org/10.1007/978-3-319-76348-4_6

Using Sentence Similarity Measure for Plagiarism Detection of Arabic Documents

Abstract

Author supplied keywords

Cite

Register to see more suggestions