The problem of learning from imbalanced data sets, while not the same problem as learning when misclassification costs are unequal and unknown, can be handled in a similar manner. That is, in both contexts, we can use techniques from roc analysis to help with classifier design. We present results from two studies in which we dealt with skewed data sets and unequal, but unknown costs of error. We also compare for one domain these results to those obtained by over-sampling and under-sampling the data set. The operations of sampling, moving the decision threshold, and adjusting the cost matrix produced sets of classifiers that fell on the same roc curve. 1.
CITATION STYLE
Zahoransky, R. A., Allelein, H.-J., Bollin, E., Oehler, H., Schelling, U., & Schwarz, H. (2013). Kyoto-Protokoll. In Energietechnik (pp. 467–479). Springer Fachmedien Wiesbaden. https://doi.org/10.1007/978-3-8348-2279-6_20
Mendeley helps you to discover research relevant for your work.