Improving Performance of English-Hindi CLIR System using Linguistic Tools and Techniques

  • Seetha A
  • Das S
  • Kumar M
N/ACitations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

World Wide Web is growing rapidly and the content on Web of languages other than English is also increasing rapidly compared to English. Hindi is most widely spoken language in India. In past few years Hindi content has also increased rapidly on the Web. To ensure complete information exchange, in the era of globalization the information retrieval systems need to be multilingual or cross lingual. We have designed and developed an English-Hindi Cross Language Information Retrieval (CLIR) System using Dictionary based query translation method. Our previous experiments [5] showed reasonable 64.80% performance of the monolingual retrieval with this system using the TREC style test collection created especially for this research. This paper describes results of the English-Hindi CLIR experiments using some specialized query formulation strategies like stopword removal, stemming of query terms, transliteration of out of vocabulary words etc. The results demonstrated that the performance gradually improved when we applied NLP tools and techniques in short queries. Performance was dropped down to some extent when using query expansion and structuring as well using long queries to obtained cross-language results. The best performance result we obtained from these experiments was 82.91% compared to the monolingual retrieval.

Cite

CITATION STYLE

APA

Seetha, A., Das, S., & Kumar, M. (2009). Improving Performance of English-Hindi CLIR System using Linguistic Tools and Techniques. In Proceedings of the First International Conference on Intelligent Human Computer Interaction (pp. 261–271). Springer India. https://doi.org/10.1007/978-81-8489-203-1_26

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free