This paper study splitting criterion in decision trees using three original points of view. First we propose a unified formalization for association measures based on entropy of typo beta. This formalization includes popular measures such as Gini index or Shannon entropy. Second, we generate artificial data from M-of-N concepts whoso complexity and class distribution are controlled. Third, our experiment allows us to study the behavior of measures on datasets of growing complexity. The results show' that the differences of performances between measures, which are significant when there is no noise in the data, disappear when the level of noise increases.
CITATION STYLE
Rakotomalala, R., Lallich, S., & Di Palma, S. (1999). Studying the behavior of generalized entropy in induction trees using a M-of-N concept. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1704, pp. 510–517). Springer Verlag. https://doi.org/10.1007/978-3-540-48247-5_66
Mendeley helps you to discover research relevant for your work.