On automatic plagiarism detection based on n-grams comparison

Alberto Barrón-Cedeño; Paolo Rosso

Conference Proceedings

On automatic plagiarism detection based on n-grams comparison

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5478 LNCS 696-700

DOI: 10.1007/978-3-642-00958-7_69

85Citations

75Readers

Get full text

Abstract

When automatic plagiarism detection is carried out considering a reference corpus, a suspicious text is compared to a set of original documents in order to relate the plagiarised text fragments to their potential source. One of the biggest difficulties in this task is to locateplagiarised fragments that have been modified (by rewording, insertion or deletion, for example) from the source text. The definition of proper text chunks as comparison units of the suspicious and original texts is crucial for the success of this kind of applications. Our experiments with the METER corpus show that the best results are obtained when considering low level word n-grams comparisons (n = {2, 3}). © Springer-Verlag Berlin Heidelberg 2009.

Author supplied keywords

Cite

CITATION STYLE

APA

Barrón-Cedeño, A., & Rosso, P. (2009). On automatic plagiarism detection based on n-grams comparison. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5478 LNCS, pp. 696–700). https://doi.org/10.1007/978-3-642-00958-7_69

On automatic plagiarism detection based on n-grams comparison

Abstract

Author supplied keywords

Cite

Register to see more suggestions