CUDT: A CUDA Based Decision Tree Algorithm

Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be impr...

Full description

Saved in:
Bibliographic Details
Main Authors: Win-Tsung Lo, Yue-Shan Chang, Ruey-Kai Sheu, Chun-Chieh Chiu, Shyan-Ming Yuan
Format: Article
Language:English
Published: Wiley 2014-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1155/2014/745640
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832559342943993856
author Win-Tsung Lo
Yue-Shan Chang
Ruey-Kai Sheu
Chun-Chieh Chiu
Shyan-Ming Yuan
author_facet Win-Tsung Lo
Yue-Shan Chang
Ruey-Kai Sheu
Chun-Chieh Chiu
Shyan-Ming Yuan
author_sort Win-Tsung Lo
collection DOAJ
description Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in the era without new technology help. In order to improve data processing latency in huge data mining, in this paper, we design and implement a new parallelized decision tree algorithm on a CUDA (compute unified device architecture), which is a GPGPU solution provided by NVIDIA. In the proposed system, CPU is responsible for flow control while the GPU is responsible for computation. We have conducted many experiments to evaluate system performance of CUDT and made a comparison with traditional CPU version. The results show that CUDT is 5∼55 times faster than Weka-j48 and is 18 times speedup than SPRINT for large data set.
format Article
id doaj-art-4a24fb2077784bda82516faaf07b4d5f
institution Kabale University
issn 2356-6140
1537-744X
language English
publishDate 2014-01-01
publisher Wiley
record_format Article
series The Scientific World Journal
spelling doaj-art-4a24fb2077784bda82516faaf07b4d5f2025-02-03T01:30:20ZengWileyThe Scientific World Journal2356-61401537-744X2014-01-01201410.1155/2014/745640745640CUDT: A CUDA Based Decision Tree AlgorithmWin-Tsung Lo0Yue-Shan Chang1Ruey-Kai Sheu2Chun-Chieh Chiu3Shyan-Ming Yuan4Department of Computer Science, Tung Hai University, Taichung 40704, TaiwanDepartment of Computer Science and Information Engineering, National Taipei University, New Taipei 23741, TaiwanDepartment of Computer Science, Tung Hai University, Taichung 40704, TaiwanDepartment of Computer Science, National Chiao Tung University, Hsinchu 30010, TaiwanDepartment of Computer Science, National Chiao Tung University, Hsinchu 30010, TaiwanDecision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in the era without new technology help. In order to improve data processing latency in huge data mining, in this paper, we design and implement a new parallelized decision tree algorithm on a CUDA (compute unified device architecture), which is a GPGPU solution provided by NVIDIA. In the proposed system, CPU is responsible for flow control while the GPU is responsible for computation. We have conducted many experiments to evaluate system performance of CUDT and made a comparison with traditional CPU version. The results show that CUDT is 5∼55 times faster than Weka-j48 and is 18 times speedup than SPRINT for large data set.http://dx.doi.org/10.1155/2014/745640
spellingShingle Win-Tsung Lo
Yue-Shan Chang
Ruey-Kai Sheu
Chun-Chieh Chiu
Shyan-Ming Yuan
CUDT: A CUDA Based Decision Tree Algorithm
The Scientific World Journal
title CUDT: A CUDA Based Decision Tree Algorithm
title_full CUDT: A CUDA Based Decision Tree Algorithm
title_fullStr CUDT: A CUDA Based Decision Tree Algorithm
title_full_unstemmed CUDT: A CUDA Based Decision Tree Algorithm
title_short CUDT: A CUDA Based Decision Tree Algorithm
title_sort cudt a cuda based decision tree algorithm
url http://dx.doi.org/10.1155/2014/745640
work_keys_str_mv AT wintsunglo cudtacudabaseddecisiontreealgorithm
AT yueshanchang cudtacudabaseddecisiontreealgorithm
AT rueykaisheu cudtacudabaseddecisiontreealgorithm
AT chunchiehchiu cudtacudabaseddecisiontreealgorithm
AT shyanmingyuan cudtacudabaseddecisiontreealgorithm