Mining diversified shared decision tree sets for discovering cross domain similarities

Guozhu Dong; Qian Han

Conference Proceedings

Mining diversified shared decision tree sets for discovering cross domain similarities

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8444 LNAI(PART 2) 534-547

DOI: 10.1007/978-3-319-06605-9_44

0Citations

1Readers

Get full text

Abstract

This paper studies the problem of mining diversified sets of shared decision trees (SDTs). Given two datasets representing two application domains, an SDT is a decision tree that can perform classification on both datasets and it captures class-based population-structure similarity between the two datasets. Previous studies considered mining just one SDT. The present paper considers mining a small diversified set of SDTs having two properties: (1) each SDT in the set has high quality with regard to "shared" accuracy and population-structure similarity and (2) different SDTs in the set are very different from each other. A diversified set of SDTs can serve as a concise representative of the huge space of possible cross-domain similarities, thus offering an effective way for users to examine/select informative SDTs from that huge space. The diversity of an SDT set is measured in terms of the difference of the attribute usage among the SDTs. The paper provides effective algorithms to mine diversified sets of SDTs. Experimental results show that the algorithms are effective and can find diversified sets of high quality SDTs. © 2014 Springer International Publishing.

Author supplied keywords

Cite

CITATION STYLE

APA

Dong, G., & Han, Q. (2014). Mining diversified shared decision tree sets for discovering cross domain similarities. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8444 LNAI, pp. 534–547). Springer Verlag. https://doi.org/10.1007/978-3-319-06605-9_44

Mining diversified shared decision tree sets for discovering cross domain similarities

Abstract

Author supplied keywords

Cite

Register to see more suggestions