Mining patterns is a core task in data analysis and, beyond issues of efficient enumeration, the selection of patterns constitutes a major challenge. The Minimum Description Length (MDL) principle, a model selection method grounded in information theory, has been applied to pattern mining with the aim to obtain compact high-quality sets of patterns. After giving an outline of relevant concepts from information theory and coding, we review MDL-based methods for mining different kinds of patterns from various types of data. Finally, we open a discussion on some issues regarding these methods.
CITATION STYLE
Galbrun, E. (2022). The minimum description length principle for pattern mining: a survey. Data Mining and Knowledge Discovery, 36(5), 1679–1727. https://doi.org/10.1007/s10618-022-00846-z
Mendeley helps you to discover research relevant for your work.