Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study

The coronavirus disease 2019 (Covid-19) pandemic has affected most countries of the world. The detection of Covid-19 positive cases is an important step to fight the pandemic and save human lives. The polymerase chain reaction test is the most used method to detect Covid-19 positive cases. Various m...

Full description

Saved in:
Bibliographic Details
Main Authors: Amir Ahmad, Ourooj Safi, Sharaf Malebary, Sami Alesawi, Entisar Alkayal
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2021/5550344
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832547660207226880
author Amir Ahmad
Ourooj Safi
Sharaf Malebary
Sami Alesawi
Entisar Alkayal
author_facet Amir Ahmad
Ourooj Safi
Sharaf Malebary
Sami Alesawi
Entisar Alkayal
author_sort Amir Ahmad
collection DOAJ
description The coronavirus disease 2019 (Covid-19) pandemic has affected most countries of the world. The detection of Covid-19 positive cases is an important step to fight the pandemic and save human lives. The polymerase chain reaction test is the most used method to detect Covid-19 positive cases. Various molecular methods and serological methods have also been explored to detect Covid-19 positive cases. Machine learning algorithms have been applied to various kinds of datasets to predict Covid-19 positive cases. The machine learning algorithms were applied on a Covid-19 dataset based on commonly taken laboratory tests to predict Covid-19 positive cases. These types of datasets are easy to collect. The paper investigates the application of decision tree ensembles which are accurate and robust to the selection of parameters. As there is an imbalance between the number of positive cases and the number of negative cases, decision tree ensembles developed for imbalanced datasets are applied. F-measure, precision, recall, area under the precision-recall curve, and area under the receiver operating characteristic curve are used to compare different decision tree ensembles. Different performance measures suggest that decision tree ensembles developed for imbalanced datasets perform better. Results also suggest that including age as a variable can improve the performance of various ensembles of decision trees.
format Article
id doaj-art-a8baa35820eb484ca8440920de80cda0
institution Kabale University
issn 1076-2787
1099-0526
language English
publishDate 2021-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-a8baa35820eb484ca8440920de80cda02025-02-03T06:43:55ZengWileyComplexity1076-27871099-05262021-01-01202110.1155/2021/55503445550344Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative StudyAmir Ahmad0Ourooj Safi1Sharaf Malebary2Sami Alesawi3Entisar Alkayal4College of Information Technology, United Arab Emirates University, Al-Ain, Abu Dhabi, UAEIndependent Researcher, Al-Ain, UAEDepartment of Information Technology, King Abdulaziz University, Rabigh 21911, Saudi ArabiaDepartment of Computer Science, King Abdulaziz University, P. O. Box 344, Rabigh 21911, Saudi ArabiaDepartment of Information Technology, King Abdulaziz University, Rabigh 21911, Saudi ArabiaThe coronavirus disease 2019 (Covid-19) pandemic has affected most countries of the world. The detection of Covid-19 positive cases is an important step to fight the pandemic and save human lives. The polymerase chain reaction test is the most used method to detect Covid-19 positive cases. Various molecular methods and serological methods have also been explored to detect Covid-19 positive cases. Machine learning algorithms have been applied to various kinds of datasets to predict Covid-19 positive cases. The machine learning algorithms were applied on a Covid-19 dataset based on commonly taken laboratory tests to predict Covid-19 positive cases. These types of datasets are easy to collect. The paper investigates the application of decision tree ensembles which are accurate and robust to the selection of parameters. As there is an imbalance between the number of positive cases and the number of negative cases, decision tree ensembles developed for imbalanced datasets are applied. F-measure, precision, recall, area under the precision-recall curve, and area under the receiver operating characteristic curve are used to compare different decision tree ensembles. Different performance measures suggest that decision tree ensembles developed for imbalanced datasets perform better. Results also suggest that including age as a variable can improve the performance of various ensembles of decision trees.http://dx.doi.org/10.1155/2021/5550344
spellingShingle Amir Ahmad
Ourooj Safi
Sharaf Malebary
Sami Alesawi
Entisar Alkayal
Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study
Complexity
title Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study
title_full Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study
title_fullStr Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study
title_full_unstemmed Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study
title_short Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study
title_sort decision tree ensembles to predict coronavirus disease 2019 infection a comparative study
url http://dx.doi.org/10.1155/2021/5550344
work_keys_str_mv AT amirahmad decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy
AT ouroojsafi decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy
AT sharafmalebary decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy
AT samialesawi decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy
AT entisaralkayal decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy