The evaluation of sentence similarity measures

Palakorn Achananuparp; Xiaohua Hu; Xiajiong Shen

Conference Proceedings

The evaluation of sentence similarity measures

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 5182 LNCS 305-316

DOI: 10.1007/978-3-540-85836-2_29

174Citations

258Readers

Get full text

Abstract

The ability to accurately judge the similarity between natural language sentences is critical to the performance of several applications such as text mining, question answering, and text summarization. Given two sentences, an effective similarity measure should be able to determine whether the sentences are semantically equivalent or not, taking into account the variability of natural language expression. That is, the correct similarity judgment should be made even if the sentences do not share similar surface form. In this work, we evaluate fourteen existing text similarity measures which have been used to calculate similarity score between sentences in many text applications. The evaluation is conducted on three different data sets, TREC9 question variants, Microsoft Research paraphrase corpus, and the third recognizing textual entailment data set. © 2008 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Achananuparp, P., Hu, X., & Shen, X. (2008). The evaluation of sentence similarity measures. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5182 LNCS, pp. 305–316). https://doi.org/10.1007/978-3-540-85836-2_29

The evaluation of sentence similarity measures

Abstract

Author supplied keywords

Cite

Register to see more suggestions