Performance comparison of language models for information retrieval

Shuaixiang Dai; Qian Diao; Changle Zhou

Conference ProceedingsOPEN ACCESS

Performance comparison of language models for information retrieval

IFIP Advances in Information and Communication Technology (2005) 187 721-730

DOI: 10.1007/0-387-29295-0_78

1Citations

9Readers

Abstract

Vector Space Model (VSM), Statistical Language Model (SLM) and Inference Network are three distinguished language models. Instead of evaluating their performance directly, we estimate the information strategies founded on them using the known measures: precision and recall. What's more, we proposed the Sort Order Rationality (SOR) to make further performance comparison among different language models. All models are tested on a standard testing collection. Three important conclusions are attained: (1). The IR model combining the statistical language modeling and inference network approaches is better than that only founded on statistical language modeling approach. What's more, it is also better than that based on vector space modeling approach. (2). The performance of IR model based on VSM is similar to that based on SLM. (3). The Dirichlet priors method often is a better option to smooth a statistical language model. In some respects, these conclusions provide some experimental bases for constructing an efficient information retrieval system. © 2005 by International Federation for Information Processing.

Author supplied keywords

Cite

CITATION STYLE

APA

Dai, S., Diao, Q., & Zhou, C. (2005). Performance comparison of language models for information retrieval. In IFIP Advances in Information and Communication Technology (Vol. 187, pp. 721–730). Springer New York LLC. https://doi.org/10.1007/0-387-29295-0_78

Performance comparison of language models for information retrieval

Abstract

Author supplied keywords

Cite

Register to see more suggestions