Abstract
Statistical language modeling (SLM) has been used in many different domains for decades and has also been applied to information retrieval (IR) recently. Documents retrieved using this approach are ranked according their probability of generating the given query. In this paper, we present a novel approach that employs the generalized Expectation Maximization (EM) algorithm to improve language models by representing their parameters as observation probabilities of Hidden Markov Models (HMM). In the experiments, we demonstrate that our method outperforms standard SLM-based and tf.idf-based methods on TREC 2005 HARD Track data. © 2009 ACL and AFNLP.
Cite
CITATION STYLE
Chiu, J. L. T., & Huang, J. W. (2009). Optimizing language model information retrieval system with Expectation Maximization algorithm. In ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf. (pp. 63–71). https://doi.org/10.3115/1667884.1667894
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.