An improved model selection heuristic for AUC

Shaomin Wu; Peter Flach; Cèsar Ferri

Conference ProceedingsOPEN ACCESS

An improved model selection heuristic for AUC

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4701 LNAI 478-489

DOI: 10.1007/978-3-540-74958-5_44

29Citations

28Readers

Abstract

The area under the ROC curve (AUC) has been widely used to measure ranking performance for binary classification tasks. AUC only employs the classifier's scores to rank the test instances; thus, it ignores other valuable information conveyed by the scores, such as sensitivity to small differences in the score values. However, as such differences are inevitable across samples, ignoring them may lead to overfitting the validation set when selecting models with high AUC. This problem is tackled in this paper. On the basis of ranks as well as scores, we introduce a new metric called scored AUC (sAUC), which is the area under the sROC curve. The latter measures how quickly AUC deteriorates if positive scores are decreased. We study the interpretation and statistical properties of sAUC. Experimental results on UCI data sets convincingly demonstrate the effectiveness of the new metric for classifier evaluation and selection in the case of limited validation data. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Wu, S., Flach, P., & Ferri, C. (2007). An improved model selection heuristic for AUC. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4701 LNAI, pp. 478–489). Springer Verlag. https://doi.org/10.1007/978-3-540-74958-5_44

An improved model selection heuristic for AUC

Abstract

Cite

Register to see more suggestions