Learning Mixtures of Weighted Tree-Unions by Minimizing Description Length

Andrea Torsello; Edwin R. Hancock

Journal ArticleOPEN ACCESS

Learning Mixtures of Weighted Tree-Unions by Minimizing Description Length

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3023 13-25

DOI: 10.1007/978-3-540-24672-5_2

0Citations

4Readers

Abstract

This paper focuses on how to perform the unsupervised clustering of tree structures in an information theoretic setting. We pose the problem of clustering as that of locating a series of archetypes that can be used to represent the variations in tree structure present in the training sample. The archetypes are tree-unions that are formed by merging sets of sample trees, and are attributed with probabilities that measure the node frequency or weight in the training sample. The approach is designed to operate when the correspondences between nodes are unknown and must be inferred as part of the learning process. We show how the tree merging process can be posed as the minimisation of an information theoretic minimum descriptor length criterion. We illustrate the utility of the resulting algorithm on the problem of classifying 2D shapes using a shock graph representation. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Torsello, A., & Hancock, E. R. (2004). Learning Mixtures of Weighted Tree-Unions by Minimizing Description Length. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3023, 13–25. https://doi.org/10.1007/978-3-540-24672-5_2

Learning Mixtures of Weighted Tree-Unions by Minimizing Description Length

Abstract

Cite

Register to see more suggestions