Maximal exceptions with minimal descriptions

25Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We introduce a new approach to Exceptional Model Mining. Our algorithm, called EMDM, is an iterative method that alternates between Exception Maximisation and Description Minimisation. As a result, it finds maximally exceptional models with minimal descriptions. Exceptional Model Mining was recently introduced by Leman et al. (Exceptional model mining 1-16, 2008) as a generalisation of Subgroup Discovery. Instead of considering a single target attribute, it allows for multiple 'model' attributes on which models are fitted. If the model for a subgroup is substantially different from the model for the complete database, it is regarded as an exceptional model. To measure exceptionality, we propose two information-theoretic measures. One is based on the Kullback-Leibler divergence, the other on Krimp. We show how compression can be used for exception maximisation with these measures, and how classification can be used for description minimisation. Experiments show that our approach efficiently identifies subgroups that are both exceptional and interesting. © The Author(s) 2010.

Cite

CITATION STYLE

APA

Van Leeuwen, M. (2010). Maximal exceptions with minimal descriptions. In Data Mining and Knowledge Discovery (Vol. 21, pp. 259–276). https://doi.org/10.1007/s10618-010-0187-5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free