Reference standards, judges, and comparison subjects: Roles for experts in evaluating system performance

George Hripcsak; Adam Wilcox

ArticleOPEN ACCESS

Reference standards, judges, and comparison subjects: Roles for experts in evaluating system performance

Journal of the American Medical Informatics Association

DOI: 10.1136/jamia.2002.0090001

65Citations

36Readers

Get full text

Abstract

Medical informatics systems are often designed to perform at the level of human experts. Evaluation of the performance of these systems is often constrained by lack of reference standards, either because the appropriate response is not known or because no simple appropriate response exists. Even when performance can be assessed, it is not always clear whether the performance is sufficient or reasonable. These challenges can be addressed if an evaluator enlists the help of clinical domain experts. 1) The experts can carry out the same tasks as the system, and then their responses can be combined to generate a reference standard. 2) The experts can judge the appropriateness of system output directly. 3) The experts can serve as comparison subjects with which the system can be compared. These are separate roles that have different implications for study design, metrics, and issues of reliability and validity. Diagrams help delineate the roles of experts in complex study designs.

Cite

CITATION STYLE

APA

Hripcsak, G., & Wilcox, A. (2002). Reference standards, judges, and comparison subjects: Roles for experts in evaluating system performance. Journal of the American Medical Informatics Association. Hanley and Belfus Inc. https://doi.org/10.1136/jamia.2002.0090001

Reference standards, judges, and comparison subjects: Roles for experts in evaluating system performance

Abstract

Cite

Register to see more suggestions