How to evaluate an agent's behavior to infrequent events? - Reliable performance estimation insensitive to class distribution

37Citations
Citations of this article
53Readers
Mendeley users who have this article in their library.

Abstract

In everyday life, humans and animals often have to base decisions on infrequent relevant stimuli with respect to frequent irrelevant ones. When research in neuroscience mimics this situation, the effect of this imbalance in stimulus classes on performance evaluation has to be considered. This is most obvious for the often used overall accuracy, because the proportion of correct responses is governed by the more frequent class. This imbalance problem has been widely debated across disciplines and out of the discussed treatments this review focusses on performance estimation. For this, a more universal view is taken: an agent performing a classification task. Commonly used performance measures are characterized when used with imbalanced classes. Metrics like Accuracy, F-Measure, Matthews Correlation Coefficient, and Mutual Information are affected by imbalance, while other metrics do not have this drawback, like AUC, d-prime, Balanced Accuracy, Weighted Accuracy and G-Mean. It is pointed out that one is not restricted to this group of metrics, but the sensitivity to the class ratio has to be kept in mind for a proper choice. Selecting an appropriate metric is critical to avoid drawing misled conclusions. © 2014 Straubeand Krell.

Cite

CITATION STYLE

APA

Straube, D., & Krell, M. M. (2014, April 10). How to evaluate an agent’s behavior to infrequent events? - Reliable performance estimation insensitive to class distribution. Frontiers in Computational Neuroscience. Frontiers Research Foundation. https://doi.org/10.3389/fncom.2014.00043

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free