Optimizing classifiers for hypothetical scenarios

Reid A. Johnson; Troy Raeder; Nitesh V. Chawla

Conference Proceedings

Optimizing classifiers for hypothetical scenarios

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9077 264-276

DOI: 10.1007/978-3-319-18038-0_21

2Citations

8Readers

Get full text

Abstract

The deployment of classification models is an integral component of many modern data mining and machine learning applications. A typical classification model is built with the tacit assumption that the deployment scenario by which it is evaluated is fixed and fully characterized. Yet, in the practical deployment of classification methods, important aspects of the application environment, such as the misclassification costs, may be uncertain during model building. Moreover, a single classification model may be applied in several different deployment scenarios. In this work, we propose a method to optimize a model for uncertain deployment scenarios. We begin by deriving a relationship between two evaluation measures, H measure and cost curves, that may be used to address uncertainty in classifier performance. We show that when uncertainty in classifier performance is modeled as a probabilistic belief that is a function of this underlying relationship, a natural definition of risk emerges for both classifiers and instances. We then leverage this notion of risk to develop a boosting-based algorithm—which we call RiskBoost— that directly mitigates classifier risk, and we demonstrate that it outperforms AdaBoost on a diverse selection of datasets.

Cite

CITATION STYLE

APA

Johnson, R. A., Raeder, T., & Chawla, N. V. (2015). Optimizing classifiers for hypothetical scenarios. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9077, pp. 264–276). Springer Verlag. https://doi.org/10.1007/978-3-319-18038-0_21

Optimizing classifiers for hypothetical scenarios

Abstract

Cite

Register to see more suggestions