Retrieval of documents is used for finding relevant documents to user queries and plagiarism is the act of copying the contents of one's work without any acknowledgement. Paraphrasing is a type of plagiarism where the contents from source may be changed. This paper proposes a new document retrieval system and paraphrase plagiarism detection of text documents using multi-layered self organizing map (MLSOM). In the proposed system tree structure is extracted for the document that hierarchically represents the document features as document, pages and paragraphs. To handle the tree-structured documents in an efficient way, MLSOM is used as a clustering algorithm. Using MLSOM the documents can be compared for detecting plagiarism and it finds out the local similarity. Paraphrased plagiarism can be detected by finding the similarity between sentences of two documents which is a kind of local similarity detection. © 2011 Springer-Verlag.
CITATION STYLE
Sandhya, S., & Chitrakala, S. (2011). Plagiarism detection of paraphrases in text documents with document retrieval. In Communications in Computer and Information Science (Vol. 198 CCIS, pp. 330–338). https://doi.org/10.1007/978-3-642-22555-0_34
Mendeley helps you to discover research relevant for your work.