An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity

Discretization algorithm for real value attributes is of very important uses in many areas such as intelligence and machine learning. The algorithms related to Chi2 algorithm (includes modified Chi2 algorithm and extended Chi2 algorithm) are famous discretization algorithm exploiting the technique o...

Full description

Saved in:
Bibliographic Details
Main Authors: Li Zou, Deqin Yan, Hamid Reza Karimi, Peng Shi
Format: Article
Language:English
Published: Wiley 2013-01-01
Series:Journal of Applied Mathematics
Online Access:http://dx.doi.org/10.1155/2013/350123
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832552428864536576
author Li Zou
Deqin Yan
Hamid Reza Karimi
Peng Shi
author_facet Li Zou
Deqin Yan
Hamid Reza Karimi
Peng Shi
author_sort Li Zou
collection DOAJ
description Discretization algorithm for real value attributes is of very important uses in many areas such as intelligence and machine learning. The algorithms related to Chi2 algorithm (includes modified Chi2 algorithm and extended Chi2 algorithm) are famous discretization algorithm exploiting the technique of probability and statistics. In this paper the algorithms are analyzed, and their drawback is pointed. Based on the analysis a new modified algorithm based on interval similarity is proposed. The new algorithm defines an interval similarity function which is regarded as a new merging standard in the process of discretization. At the same time, two important parameters (condition parameter α and tiny move parameter c) in the process of discretization and discrepancy extent of a number of adjacent two intervals are given in the form of function. The related theory analysis and the experiment results show that the presented algorithm is effective.
format Article
id doaj-art-4dc70d1d7ecb4c528209af17378fc42f
institution Kabale University
issn 1110-757X
1687-0042
language English
publishDate 2013-01-01
publisher Wiley
record_format Article
series Journal of Applied Mathematics
spelling doaj-art-4dc70d1d7ecb4c528209af17378fc42f2025-02-03T05:58:40ZengWileyJournal of Applied Mathematics1110-757X1687-00422013-01-01201310.1155/2013/350123350123An Algorithm for Discretization of Real Value Attributes Based on Interval SimilarityLi Zou0Deqin Yan1Hamid Reza Karimi2Peng Shi3School of Computer and Information Technology, Liaoning Normal University, Dalian 116029, ChinaSchool of Computer and Information Technology, Liaoning Normal University, Dalian 116029, ChinaDepartment of Engineering, Faculty of Engineering and Science, University of Agder, 4898 Grimstad, NorwayCollege of Engineering and Science, Victoria University, Melbourne, VIC 8001, AustraliaDiscretization algorithm for real value attributes is of very important uses in many areas such as intelligence and machine learning. The algorithms related to Chi2 algorithm (includes modified Chi2 algorithm and extended Chi2 algorithm) are famous discretization algorithm exploiting the technique of probability and statistics. In this paper the algorithms are analyzed, and their drawback is pointed. Based on the analysis a new modified algorithm based on interval similarity is proposed. The new algorithm defines an interval similarity function which is regarded as a new merging standard in the process of discretization. At the same time, two important parameters (condition parameter α and tiny move parameter c) in the process of discretization and discrepancy extent of a number of adjacent two intervals are given in the form of function. The related theory analysis and the experiment results show that the presented algorithm is effective.http://dx.doi.org/10.1155/2013/350123
spellingShingle Li Zou
Deqin Yan
Hamid Reza Karimi
Peng Shi
An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity
Journal of Applied Mathematics
title An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity
title_full An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity
title_fullStr An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity
title_full_unstemmed An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity
title_short An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity
title_sort algorithm for discretization of real value attributes based on interval similarity
url http://dx.doi.org/10.1155/2013/350123
work_keys_str_mv AT lizou analgorithmfordiscretizationofrealvalueattributesbasedonintervalsimilarity
AT deqinyan analgorithmfordiscretizationofrealvalueattributesbasedonintervalsimilarity
AT hamidrezakarimi analgorithmfordiscretizationofrealvalueattributesbasedonintervalsimilarity
AT pengshi analgorithmfordiscretizationofrealvalueattributesbasedonintervalsimilarity
AT lizou algorithmfordiscretizationofrealvalueattributesbasedonintervalsimilarity
AT deqinyan algorithmfordiscretizationofrealvalueattributesbasedonintervalsimilarity
AT hamidrezakarimi algorithmfordiscretizationofrealvalueattributesbasedonintervalsimilarity
AT pengshi algorithmfordiscretizationofrealvalueattributesbasedonintervalsimilarity