Abstract
Due to the rapid increase of internet-based data, there is urgent need for a robust intelligent documents security mechanism. Although there are many attempts to build a plagiarism detection system in natural language documents, the unlimited variation and different writing styles of each character in Arabic documents make building such systems challenging. Based on its position in a word, the same Arabic letter can be written three different ways, which makes the handwritten character recognition a cumbersome process. This article proposes an intelligent unsupervised model to detect plagiarism in these documents called ASTAP. First, a handwritten Arabic character recognition system is proposed using the Grey Wolf Optimization (GWO) algorithm. Then, a modified Abstract Syntax Tree (AST) is used to match the contents of the Arabic documents to detect any similarity. Compared to the state-of-the-art methods, ASTAP improves the effectiveness of the plagiarism detection in terms of the matched similarity ratio, the precision ratio, and the processing time.
Author supplied keywords
Cite
CITATION STYLE
Zaher, M., Shehab, A., Elhoseny, M., & Farahat, F. F. (2020). Unsupervised model for detecting plagiarism in internet-based handwritten Arabic documents. Journal of Organizational and End User Computing, 32(2), 42–66. https://doi.org/10.4018/JOEUC.2020040103
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.