A hybrid decision tree training method using data streams

Michal Wozniak

Journal ArticleOPEN ACCESS

A hybrid decision tree training method using data streams

Wozniak M

Knowledge and Information Systems (2011) 29(2) 335-347

DOI: 10.1007/s10115-010-0345-5

46Citations

31Readers

Abstract

Classical classification methods usually assume that pattern recognition models do not depend on the timing of the data. However, this assumption is not valid in cases where new data frequently become available. Such situations are common in practice, for example, spam filtering or fraud detection, where dependencies between feature values and class numbers are continually changing. Unfortunately, most classical machine learning methods (such as decision trees) do not take into consideration the possibility of the model changing, as a result of so-called concept drift and they cannot adapt to a new classification model. This paper focuses on the problem of concept drift, which is a very important issue, especially in data mining methods that use complex structures (such as decision trees) for making decisions. We propose an algorithm that is able to co-train decision trees using a modified NGE (Nested Generalized Exemplar) algorithm. The potential for adaptation of the proposed algorithm and the quality thereof are evaluated through computer experiments, carried out on benchmark datasets from the UCI Machine Learning Repository. © 2010 The Author(s).

Author supplied keywords

Cite

CITATION STYLE

APA

Wozniak, M. (2011). A hybrid decision tree training method using data streams. Knowledge and Information Systems, 29(2), 335–347. https://doi.org/10.1007/s10115-010-0345-5

A hybrid decision tree training method using data streams

Abstract

Author supplied keywords

Cite

Register to see more suggestions