Citation matching in Sanskrit corpora using local alignment

Abhinandan S. Prasad; Shrisha Rao

Conference Proceedings

Citation matching in Sanskrit corpora using local alignment

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6465 LNAI 124-136

DOI: 10.1007/978-3-642-17528-2_9

0Citations

6Readers

Get full text

Abstract

Citation matching is the problem of finding which citation occurs in a given textual corpus. Most existing citation matching work is done on scientific literature. The goal of this paper is to present methods for performing citation matching on Sanskrit texts. Exact matching and approximate matching are the two methods for performing citation matching. The exact matching method checks for exact occurrence of the citation with respect to the textual corpus. Approximate matching is a fuzzy string-matching method which computes a similarity score between an individual line of the textual corpus and the citation. The Smith-Waterman-Gotoh algorithm for local alignment, which is generally used in bioinformatics, is used here for calculating the similarity score. This similarity score is a measure of the closeness between the text and the citation. The exact- and approximate-matching methods are evaluated and compared. The methods presented can be easily applied to corpora in other Indic languages like Kannada, Tamil, etc. The approximate-matching method can in particular be used in the compilation of critical editions and plagiarism detection in a literary work. © 2010 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Prasad, A. S., & Rao, S. (2010). Citation matching in Sanskrit corpora using local alignment. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6465 LNAI, pp. 124–136). https://doi.org/10.1007/978-3-642-17528-2_9

Citation matching in Sanskrit corpora using local alignment

Abstract

Author supplied keywords

Cite

Register to see more suggestions