Knowledge-based sampling for subgroup discovery

Martin Scholz

Conference Proceedings

Knowledge-based sampling for subgroup discovery

Scholz M

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3539 LNAI 171-189

DOI: 10.1007/11504245_11

11Citations

13Readers

Get full text

Abstract

Subgroup discovery aims at finding interesting subsets of a classified example set that deviates from the overall distribution. The search is guided by a so-called utility function, trading the size of subsets (coverage) against their statistical unusualness. By choosing the utility function accordingly, subgroup discovery is well suited to find interesting rules with much smaller coverage and bias than possible with standard classifier induction algorithms. Smaller subsets can be considered local patterns, but this work uses yet another definition: According to this definition global patterns consist of all patterns reflecting the prior knowledge available to a learner, including all previously found patterns. All further unexpected regularities in the data are referred to as local patterns. To address local pattern mining in this scenario, an extension of subgroup discovery by the knowledge-based sampling approach to iterative model refinement is presented. It is a general, cheap way of incorporating prior probabilistic knowledge in arbitrary form into Data Mining algorithms addressing supervised learning tasks. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Scholz, M. (2005). Knowledge-based sampling for subgroup discovery. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3539 LNAI, pp. 171–189). Springer Verlag. https://doi.org/10.1007/11504245_11

Knowledge-based sampling for subgroup discovery

Abstract

Cite

Register to see more suggestions