Using grammar-profiles to intrinsically expose plagiarism in text documents

10Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Intrinsic plagiarism detection deals with the task of finding plagiarized sections in text documents without using a reference corpus. This paper describes a novel approach in this field by analyzing the grammar of authors and using sliding windows to find significant differences in writing styles. To find suspicious text passages, the algorithm splits a document into single sentences, calculates syntax grammar trees and builds profiles based on frequently used grammar patterns. The text is then traversed, where each window is compared to the document profile using a distance metric. Finally, all sentences that have a significantly higher distance according to a utilized Gaussian normal distribution are marked as suspicious. A preliminary evaluation of the algorithm shows very promising results. © 2013 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Tschuggnall, M., & Specht, G. (2013). Using grammar-profiles to intrinsically expose plagiarism in text documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7934 LNCS, pp. 297–302). https://doi.org/10.1007/978-3-642-38824-8_28

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free