Document similarity search based on manifold-ranking of TextTiles

6Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Document similarity search aims to find documents similar to a query document in a text corpus and return a ranked list of similar documents. Most existing approaches to document similarity search compute similarity scores between the query and the documents based on a retrieval function (e.g. Cosine) and then rank the documents by their similarity scores. In this paper, we proposed a novel retrieval approach based on manifold-ranking of TextTiles to re-rank the initially retrieved documents. The proposed approach can make full use of the intrinsic global manifold structure for the TextTiles of the documents in the re-ranking process. Experimental results demonstrate that the proposed approach can significantly improve the retrieval performances based on different retrieval functions. TextTile is validated to be a better unit than the whole document in the manifold-ranking process. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Wan, X., Yang, J., & Xiao, J. (2006). Document similarity search based on manifold-ranking of TextTiles. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4182 LNCS, pp. 14–25). Springer Verlag. https://doi.org/10.1007/11880592_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free