The receiver operating characteristic (ROC) analysis, the sampling variability can result in uncertainties of performance measures. Thus, while evaluating and comparing the performances of algorithms, the measurement uncertainties must be taken into account. The key issue is how to calculate the uncertainties of performance measures in ROC analysis. Our ultimate goal is to perform the significance test in evaluation and comparison using the standard errors computed. From the operational perspective, based on fingerprint-image matching algorithms on large datasets, the measures and their uncertainties are investigated in the three scenarios: 1) the true accept rate (TAR) of genuine scores at a specified false accept rate (FAR) of impostor scores, 2) the TAR and FAR at a given threshold, and 3) the equal error rate. The uncertainties of measures are calculated using the nonparametric two-sample bootstrap based on our extensive studies of bootstrap variabilityon large datasets. The significance test is carried out to determine whether the difference between the performance of onealgorithm and a hypothesized value, or the difference between the performances of two algorithms where the correlation is taken into account is statistically significant. Examples are provided.
CITATION STYLE
Wu, C. J., Martin, A. F., & Kacker, R. N. (2011). Measures, uncertainties, and significance test in operational roc analysis. Journal of Research of the National Institute of Standards and Technology, 116(1), 517–537. https://doi.org/10.6028/jres.116.003
Mendeley helps you to discover research relevant for your work.