A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees

The knowledge extraction from data with noise or outliers is a complex problem in the data mining area. Normally, it is not easy to eliminate those problematic instances. To obtain information from this type of data, robust classifiers are the best option to use. One of them is the application of ba...

Full description

Saved in:

Bibliographic Details
Main Authors:	Joaquín Abellán, Javier G. Castellano, Carlos J. Mantas
Format:	Article
Language:	English
Published:	Wiley 2017-01-01
Series:	Complexity
Online Access:	http://dx.doi.org/10.1155/2017/9023970
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832548535233413120
author	Joaquín Abellán Javier G. Castellano Carlos J. Mantas
author_facet	Joaquín Abellán Javier G. Castellano Carlos J. Mantas
author_sort	Joaquín Abellán
collection	DOAJ
description	The knowledge extraction from data with noise or outliers is a complex problem in the data mining area. Normally, it is not easy to eliminate those problematic instances. To obtain information from this type of data, robust classifiers are the best option to use. One of them is the application of bagging scheme on weak single classifiers. The Credal C4.5 (CC4.5) model is a new classification tree procedure based on the classical C4.5 algorithm and imprecise probabilities. It represents a type of the so-called credal trees. It has been proven that CC4.5 is more robust to noise than C4.5 method and even than other previous credal tree models. In this paper, the performance of the CC4.5 model in bagging schemes on noisy domains is shown. An experimental study on data sets with added noise is carried out in order to compare results where bagging schemes are applied on credal trees and C4.5 procedure. As a benchmark point, the known Random Forest (RF) classification method is also used. It will be shown that the bagging ensemble using pruned credal trees outperforms the successful bagging C4.5 and RF when data sets with medium-to-high noise level are classified.
format	Article
id	doaj-art-4a0d320edbcf40b5a79f991916783e92
institution	Kabale University
issn	1076-2787 1099-0526
language	English
publishDate	2017-01-01
publisher	Wiley
record_format	Article
series	Complexity
spelling	doaj-art-4a0d320edbcf40b5a79f991916783e922025-02-03T06:13:49ZengWileyComplexity1076-27871099-05262017-01-01201710.1155/2017/90239709023970A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 TreesJoaquín Abellán0Javier G. Castellano1Carlos J. Mantas2Department of Computer Science and Artificial Intelligence, University of Granada, Granada, SpainDepartment of Computer Science and Artificial Intelligence, University of Granada, Granada, SpainDepartment of Computer Science and Artificial Intelligence, University of Granada, Granada, SpainThe knowledge extraction from data with noise or outliers is a complex problem in the data mining area. Normally, it is not easy to eliminate those problematic instances. To obtain information from this type of data, robust classifiers are the best option to use. One of them is the application of bagging scheme on weak single classifiers. The Credal C4.5 (CC4.5) model is a new classification tree procedure based on the classical C4.5 algorithm and imprecise probabilities. It represents a type of the so-called credal trees. It has been proven that CC4.5 is more robust to noise than C4.5 method and even than other previous credal tree models. In this paper, the performance of the CC4.5 model in bagging schemes on noisy domains is shown. An experimental study on data sets with added noise is carried out in order to compare results where bagging schemes are applied on credal trees and C4.5 procedure. As a benchmark point, the known Random Forest (RF) classification method is also used. It will be shown that the bagging ensemble using pruned credal trees outperforms the successful bagging C4.5 and RF when data sets with medium-to-high noise level are classified.http://dx.doi.org/10.1155/2017/9023970
spellingShingle	Joaquín Abellán Javier G. Castellano Carlos J. Mantas A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees Complexity
title	A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees
title_full	A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees
title_fullStr	A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees
title_full_unstemmed	A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees
title_short	A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees
title_sort	new robust classifier on noise domains bagging of credal c4 5 trees
url	http://dx.doi.org/10.1155/2017/9023970
work_keys_str_mv	AT joaquinabellan anewrobustclassifieronnoisedomainsbaggingofcredalc45trees AT javiergcastellano anewrobustclassifieronnoisedomainsbaggingofcredalc45trees AT carlosjmantas anewrobustclassifieronnoisedomainsbaggingofcredalc45trees AT joaquinabellan newrobustclassifieronnoisedomainsbaggingofcredalc45trees AT javiergcastellano newrobustclassifieronnoisedomainsbaggingofcredalc45trees AT carlosjmantas newrobustclassifieronnoisedomainsbaggingofcredalc45trees

A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees

Similar Items