Document Classification using Multiword Features

15Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

We investigate the use of multiword query features to improve the effectiveness of text-retrieval systems that accept natural-language queries. A relevance feedback process is explained that expands an initial query with single and multiword features. The multiword features are modelled as a set of words appearing within windows of varying sizes. Our experimental results suggest that windows of larger span yield improvements in retrieval over windows of smaller span. This result gives rise to a query contraction process that prunes 25% of the features in an expanded query with no loss in retrieval effectiveness.

Cite

CITATION STYLE

APA

Papka, R., & Allan, J. (1998). Document Classification using Multiword Features. In International Conference on Information and Knowledge Management, Proceedings (Vol. 1998-January, pp. 124–131). Association for Computing Machinery. https://doi.org/10.1145/288627.288648

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free