An empirical evaluation of ranking measures with respect to robustness to noise

6Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.

Abstract

Ranking measures play an important role in model evaluation and selection. Using both synthetic and real-world data sets, we investigate how different types and levels of noise affect the area under the ROC curve (AUC), the area under the ROC convex hull, the scored AUC, the Kolmogorov-Smirnov statistic, and the H-measure. In our experiments, the AUC was, overall, the most robust among these measures, thereby reinvigorating it as a reliable metric despite its well-known deficiencies. This paper also introduces a novel ranking measure, which is remarkably robust to noise yet conceptually simple. © 2014 AI Access Foundation.

Cite

CITATION STYLE

APA

Berrar, D. (2014). An empirical evaluation of ranking measures with respect to robustness to noise. Journal of Artificial Intelligence Research, 49, 241–267. https://doi.org/10.1613/jair.4136

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free