A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees

Joaquín Abellán; Javier G. Castellano; Carlos J. Mantas

Journal ArticleOPEN ACCESS

A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees

Complexity (2017) 2017

DOI: 10.1155/2017/9023970

6Citations

16Readers

Abstract

The knowledge extraction from data with noise or outliers is a complex problem in the data mining area. Normally, it is not easy to eliminate those problematic instances. To obtain information from this type of data, robust classifiers are the best option to use. One of them is the application of bagging scheme on weak single classifiers. The Credal C4.5 (CC4.5) model is a new classification tree procedure based on the classical C4.5 algorithm and imprecise probabilities. It represents a type of the so-called credal trees. It has been proven that CC4.5 is more robust to noise than C4.5 method and even than other previous credal tree models. In this paper, the performance of the CC4.5 model in bagging schemes on noisy domains is shown. An experimental study on data sets with added noise is carried out in order to compare results where bagging schemes are applied on credal trees and C4.5 procedure. As a benchmark point, the known Random Forest (RF) classification method is also used. It will be shown that the bagging ensemble using pruned credal trees outperforms the successful bagging C4.5 and RF when data sets with medium-to-high noise level are classified.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Abellán, J., Castellano, J. G., & Mantas, C. J. (2017). A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees. Complexity, 2017. https://doi.org/10.1155/2017/9023970

Readers' Seniority

PhD / Post grad / Masters / Doc 5

71%

Lecturer / Post doc 2

29%

Readers' Discipline

Computer Science 7

100%

Article Metrics

Social Media

Shares, Likes & Comments: 33

View details >

A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees

Abstract

References Powered by Scopus

Random forests

A Mathematical Theory of Communication

Bagging predictors

Cited by Powered by Scopus

Construction site accident analysis using text mining and natural language processing techniques

A review of addressing class noise problems of remote sensing classification

Identification of Proteins of Tobacco Mosaic Virus by Using a Method of Feature Extraction

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline

Article Metrics