A reflective view on text similarity

Daniel Bär; Torsten Zesch; Iryna Gurevych

Conference Proceedings

A reflective view on text similarity

International Conference Recent Advances in Natural Language Processing, RANLP (2011) 515-520

ISSN: 13138502

27Citations

112Readers

Abstract

While the concept of similarity is well grounded in psychology, text similarity is less well-defined. Thus, we analyze text similarity with respect to its definition and the datasets used for evaluation. We formalize text similarity based on the geometric model of conceptual spaces along three dimensions inherent to texts: structure, style, and content. We empirically ground these dimensions in a set of annotation studies, and categorize applications according to these dimensions. Furthermore, we analyze the characteristics of the existing evaluation datasets, and use those datasets to assess the performance of common text similarity measures.

Cite

CITATION STYLE

APA

Bär, D., Zesch, T., & Gurevych, I. (2011). A reflective view on text similarity. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 515–520). Incoma Ltd.

A reflective view on text similarity

Abstract

Cite

Register to see more suggestions