Similarity measures for tracking information flow

123Citations
Citations of this article
117Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Text similarity spans a spectrum, with broad topical similarity near one extreme and document identity at the other. Intermediate levels of similarity - resulting from summarization, paraphrasing, copy-ing, and stronger forms of topical relevance - are useful for applications such as information flow analysis and question-answering tasks. In this paper, we explore mechanisms for measuring such intermediate kinds of similarity, focusing on the task of identifying where a particular piece of information originated. We consider both sentence-to-sentence and document-to-document comparison, and have incorporated these algorithms into RECAP, a prototype information flow analysis tool. Our experimental results with RECAP indicate that new mechanisms such as those we propose are likely to be more appropriate than existing methods for identifying the intermediate forms of similarity. Copyright 2005 ACM.

Cite

CITATION STYLE

APA

Metzler, D., Bernstein, Y., Croft, W. B., Moffat, A., & Zobel, J. (2005). Similarity measures for tracking information flow. In International Conference on Information and Knowledge Management, Proceedings (pp. 517–524). https://doi.org/10.1145/1099554.1099695

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free