Sentence-based plagiarism detection for Japanese document based on common nouns and part-of-speech structure

Takeru Yokoi

Conference Proceedings

Sentence-based plagiarism detection for Japanese document based on common nouns and part-of-speech structure

Yokoi T

Communications in Computer and Information Science (2015) 513 297-308

DOI: 10.1007/978-3-319-17530-0_21

2Citations

12Readers

Get full text

Abstract

Plagiarism by the copy and paste of documents written by other authors has recently become a large problem as electronic documents have increased. In higher educational institutions, it is also of great concern in student reports. In this paper, we have proposed a novel approach to automatically detect plagiarism, especially for student experimental reports in Japanese and focusing on the common nouns and the structure of parts of speech for each sentence. We have also performed experiments to evaluate our approach with actual Japanese experimental reports written by our students with the measures such as precision, recall and F-value. As the experimental results, our proposed approach has succeeded to detect plagiarized pairs of sentences within high accuracy. In addition, we also discuss the parts where our proposed approach miss-detected and couldn’t detect.

Author supplied keywords

Cite

CITATION STYLE

APA

Yokoi, T. (2015). Sentence-based plagiarism detection for Japanese document based on common nouns and part-of-speech structure. In Communications in Computer and Information Science (Vol. 513, pp. 297–308). Springer Verlag. https://doi.org/10.1007/978-3-319-17530-0_21

Sentence-based plagiarism detection for Japanese document based on common nouns and part-of-speech structure

Abstract

Author supplied keywords

Cite

Register to see more suggestions