Plagiarism by the copy and paste of documents written by other authors has recently become a large problem as electronic documents have increased. In higher educational institutions, it is also of great concern in student reports. In this paper, we have proposed a novel approach to automatically detect plagiarism, especially for student experimental reports in Japanese and focusing on the common nouns and the structure of parts of speech for each sentence. We have also performed experiments to evaluate our approach with actual Japanese experimental reports written by our students with the measures such as precision, recall and F-value. As the experimental results, our proposed approach has succeeded to detect plagiarized pairs of sentences within high accuracy. In addition, we also discuss the parts where our proposed approach miss-detected and couldn’t detect.
CITATION STYLE
Yokoi, T. (2015). Sentence-based plagiarism detection for Japanese document based on common nouns and part-of-speech structure. In Communications in Computer and Information Science (Vol. 513, pp. 297–308). Springer Verlag. https://doi.org/10.1007/978-3-319-17530-0_21
Mendeley helps you to discover research relevant for your work.