Beam search induction and similarity constraints for predictive clustering trees

11Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Much research on inductive databases (IDBs) focuses on local models, such as item sets and association rules. In this work, we investigate how IDBs can support global models, such as decision trees. Our focus is on predictive clustering trees (PCTs). PCTs generalize decision trees and can be used for prediction and clustering, two of the most common data mining tasks. Regular PCT induction builds PCTs top-down, using a greedy algorithm, similar to that of C4.5. We propose a new induction algorithm for PCTs based on beam search. This has three advantages over the regular method: (a) it returns a set of PCTs satisfying the user constraints instead of just one PCT; (b) it better allows for pushing of user constraints into the induction algorithm; and (c) it is less susceptible to myopia. In addition, we propose similarity constraints for PCTs, which improve the diversity of the resulting PCT set. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Kocev, D., Struyf, J., & Džeroski, S. (2007). Beam search induction and similarity constraints for predictive clustering trees. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4747 LNCS, pp. 134–151). Springer Verlag. https://doi.org/10.1007/978-3-540-75549-4_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free