Language model information retrieval with document expansion

Tao Tao; Xuanhui Wang; Qiaozhu Mei; Cheng Xiang Zhai

Conference ProceedingsOPEN ACCESS

Language model information retrieval with document expansion

HLT-NAACL 2006 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings of the Main Conference (2006) 407-414

DOI: 10.3115/1220835.1220887

108Citations

164Readers

Abstract

Language model information retrieval depends on accurate estimation of document models. In this paper, we propose a document expansion technique to deal with the problem of insufficient sampling of documents. We construct a probabilistic neighborhood for each document, and expand the document with its neighborhood information. The expanded document provides a more accurate estimation of the document model, thus improves retrieval accuracy. Moreover, since document expansion and pseudo feedback exploit different corpus structures, they can be combined to further improve performance. The experiment results on several different data sets demonstrate the effectiveness of the proposed document expansion method. © 2006 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Tao, T., Wang, X., Mei, Q., & Zhai, C. X. (2006). Language model information retrieval with document expansion. In HLT-NAACL 2006 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings of the Main Conference (pp. 407–414). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1220835.1220887

Language model information retrieval with document expansion

Abstract

Cite

Register to see more suggestions