A reflective view on text similarity

ISSN: 13138502
27Citations
Citations of this article
112Readers
Mendeley users who have this article in their library.

Abstract

While the concept of similarity is well grounded in psychology, text similarity is less well-defined. Thus, we analyze text similarity with respect to its definition and the datasets used for evaluation. We formalize text similarity based on the geometric model of conceptual spaces along three dimensions inherent to texts: structure, style, and content. We empirically ground these dimensions in a set of annotation studies, and categorize applications according to these dimensions. Furthermore, we analyze the characteristics of the existing evaluation datasets, and use those datasets to assess the performance of common text similarity measures.

Cite

CITATION STYLE

APA

Bär, D., Zesch, T., & Gurevych, I. (2011). A reflective view on text similarity. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 515–520). Incoma Ltd.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free