Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study
The coronavirus disease 2019 (Covid-19) pandemic has affected most countries of the world. The detection of Covid-19 positive cases is an important step to fight the pandemic and save human lives. The polymerase chain reaction test is the most used method to detect Covid-19 positive cases. Various m...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2021-01-01
|
Series: | Complexity |
Online Access: | http://dx.doi.org/10.1155/2021/5550344 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832547660207226880 |
---|---|
author | Amir Ahmad Ourooj Safi Sharaf Malebary Sami Alesawi Entisar Alkayal |
author_facet | Amir Ahmad Ourooj Safi Sharaf Malebary Sami Alesawi Entisar Alkayal |
author_sort | Amir Ahmad |
collection | DOAJ |
description | The coronavirus disease 2019 (Covid-19) pandemic has affected most countries of the world. The detection of Covid-19 positive cases is an important step to fight the pandemic and save human lives. The polymerase chain reaction test is the most used method to detect Covid-19 positive cases. Various molecular methods and serological methods have also been explored to detect Covid-19 positive cases. Machine learning algorithms have been applied to various kinds of datasets to predict Covid-19 positive cases. The machine learning algorithms were applied on a Covid-19 dataset based on commonly taken laboratory tests to predict Covid-19 positive cases. These types of datasets are easy to collect. The paper investigates the application of decision tree ensembles which are accurate and robust to the selection of parameters. As there is an imbalance between the number of positive cases and the number of negative cases, decision tree ensembles developed for imbalanced datasets are applied. F-measure, precision, recall, area under the precision-recall curve, and area under the receiver operating characteristic curve are used to compare different decision tree ensembles. Different performance measures suggest that decision tree ensembles developed for imbalanced datasets perform better. Results also suggest that including age as a variable can improve the performance of various ensembles of decision trees. |
format | Article |
id | doaj-art-a8baa35820eb484ca8440920de80cda0 |
institution | Kabale University |
issn | 1076-2787 1099-0526 |
language | English |
publishDate | 2021-01-01 |
publisher | Wiley |
record_format | Article |
series | Complexity |
spelling | doaj-art-a8baa35820eb484ca8440920de80cda02025-02-03T06:43:55ZengWileyComplexity1076-27871099-05262021-01-01202110.1155/2021/55503445550344Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative StudyAmir Ahmad0Ourooj Safi1Sharaf Malebary2Sami Alesawi3Entisar Alkayal4College of Information Technology, United Arab Emirates University, Al-Ain, Abu Dhabi, UAEIndependent Researcher, Al-Ain, UAEDepartment of Information Technology, King Abdulaziz University, Rabigh 21911, Saudi ArabiaDepartment of Computer Science, King Abdulaziz University, P. O. Box 344, Rabigh 21911, Saudi ArabiaDepartment of Information Technology, King Abdulaziz University, Rabigh 21911, Saudi ArabiaThe coronavirus disease 2019 (Covid-19) pandemic has affected most countries of the world. The detection of Covid-19 positive cases is an important step to fight the pandemic and save human lives. The polymerase chain reaction test is the most used method to detect Covid-19 positive cases. Various molecular methods and serological methods have also been explored to detect Covid-19 positive cases. Machine learning algorithms have been applied to various kinds of datasets to predict Covid-19 positive cases. The machine learning algorithms were applied on a Covid-19 dataset based on commonly taken laboratory tests to predict Covid-19 positive cases. These types of datasets are easy to collect. The paper investigates the application of decision tree ensembles which are accurate and robust to the selection of parameters. As there is an imbalance between the number of positive cases and the number of negative cases, decision tree ensembles developed for imbalanced datasets are applied. F-measure, precision, recall, area under the precision-recall curve, and area under the receiver operating characteristic curve are used to compare different decision tree ensembles. Different performance measures suggest that decision tree ensembles developed for imbalanced datasets perform better. Results also suggest that including age as a variable can improve the performance of various ensembles of decision trees.http://dx.doi.org/10.1155/2021/5550344 |
spellingShingle | Amir Ahmad Ourooj Safi Sharaf Malebary Sami Alesawi Entisar Alkayal Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study Complexity |
title | Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study |
title_full | Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study |
title_fullStr | Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study |
title_full_unstemmed | Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study |
title_short | Decision Tree Ensembles to Predict Coronavirus Disease 2019 Infection: A Comparative Study |
title_sort | decision tree ensembles to predict coronavirus disease 2019 infection a comparative study |
url | http://dx.doi.org/10.1155/2021/5550344 |
work_keys_str_mv | AT amirahmad decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy AT ouroojsafi decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy AT sharafmalebary decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy AT samialesawi decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy AT entisaralkayal decisiontreeensemblestopredictcoronavirusdisease2019infectionacomparativestudy |