Evaluation of evaluation in information retrieval

Tefko Saracevic

Conference Proceedings

Evaluation of evaluation in information retrieval

Saracevic T

SIGIR Forum (ACM Special Interest Group on Information Retrieval) (1995) 137-146

DOI: 10.1145/215206.215351

156Citations

188Readers

Get full text

Abstract

Evaluation is a major force in research, development and applications related to information retrieval (IR). This paper is a critical and historical analysis of evaluations of IR systems and processes. Strengths and shortcomings of evaluation efforts and approaches are discussed, together with major challenges and questions. A limited comparison is made with evaluation in experts systems and Online Public Access Catalogs (OPACs). Evaluation is further analyzed in relation to the broad context and specific problems addressed. Levels of evaluation are identified and contrasted; most IR evaluations were concerned with the processing level, but others were conducted at the output, users and use, and social levels. A major problem is the isolation of evaluations at a given level. Issues related to systems under evaluation, and evaluation criteria, measures, measuring instruments, and methodologies are examined. A general point is also considered: IR is increasingly imbedded into many other applications, such as the Internet or digital libraries. Little evaluation in the traditional IR sense is undertaken in relation to these applications. The challenges are to integrate IR evaluations from different levels and to incorporate evaluation in new applications.

Cite

CITATION STYLE

APA

Saracevic, T. (1995). Evaluation of evaluation in information retrieval. In SIGIR Forum (ACM Special Interest Group on Information Retrieval) (pp. 137–146). ACM. https://doi.org/10.1145/215206.215351

Evaluation of evaluation in information retrieval

Abstract

Cite

Register to see more suggestions