Document retrieval systems have been restricted, by the nature of the task, to techniques that can be used with large numbers of documents and broad domains. The most effective techniques that have been developed are based on the statistics of word occurrences in text. In this paper, we describe an approacli to using natural language processing (NLP) techniques for what is essentially a natural language problem-the comparison of a request text witli the text of document titles and abstracts. The proposed NLP techniques are used to develop a request model based on "conceptual case frames" and to compare this model with the texts of candidate documents. The request model is also used to provide information to statistical search techniques that identify the candidate documents. As part of a preliminary evaluation of this approach, case frame representations of a set of requests from the CACM collection were constructed. Statistical searches carried out using dependency and relative importance information derived from the request models indicate that performance benefits can be obtained.
CITATION STYLE
Croft, W. B., & Lewis, D. D. (1987). An approach to natural language processing for document retrieval. In Proceedings of the 10th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1987 (pp. 26–32). Association for Computing Machinery, Inc. https://doi.org/10.1145/42005.42009
Mendeley helps you to discover research relevant for your work.