Word particles applied to information retrieval

2Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Document retrieval systems conventionally use words as the basic unit of representation, a natural choice since words are primary carriers of semantic information. In this paper we propose the use of a different, phonetically defined unit of representation that we call "particles". Particles are phonetic sequences that do not possess meaning. Both documents and queries are converted from their standard wordbased form into sequences of particles. Indexing and retrieval is performed with particles. Experiments show that this scheme is capable of achieving retrieval performance that is comparable to that from words when the text in the documents and queries are clean, and can result in significantly improved retrieval when they are noisy. © Springer-Verlag Berlin Heidelberg 2009.

Cite

CITATION STYLE

APA

Gouv̂ea, E. B., & Raj, B. (2009). Word particles applied to information retrieval. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5478 LNCS, pp. 449–460). https://doi.org/10.1007/978-3-642-00958-7_40

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free