Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data
Cancer is the top cause of death worldwide, and machine learning (ML) has made an indelible mark on the field of early cancer detection, thereby lowering the death toll. ML-based model for cancer diagnosis is done using two forms of data: gene expression data and microarray data. The data on gene ex...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2024-01-01
|
Series: | International Journal of Telemedicine and Applications |
Online Access: | http://dx.doi.org/10.1155/2024/4105224 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832557192939569152 |
---|---|
author | Pinakshi Panda Sukant Kishoro Bisoy Sandeep Kautish Reyaz Ahmad Asma Irshad Nadeem Sarwar |
author_facet | Pinakshi Panda Sukant Kishoro Bisoy Sandeep Kautish Reyaz Ahmad Asma Irshad Nadeem Sarwar |
author_sort | Pinakshi Panda |
collection | DOAJ |
description | Cancer is the top cause of death worldwide, and machine learning (ML) has made an indelible mark on the field of early cancer detection, thereby lowering the death toll. ML-based model for cancer diagnosis is done using two forms of data: gene expression data and microarray data. The data on gene expression levels includes many dimensions. When dealing with data with a high dimension, the efficiency of an ML-based model is decreased. Microarray data is distinguished by its high dimensionality with a greater number of features and a smaller sample size. In this work, two ensemble techniques are proposed using majority voting technique and weighted average technique. Correlation feature selection (CFS) is used for feature selection, and improved grey wolf optimizer (IGWO) is used for feature optimization. Support vector machines (SVMs), multilayer perceptron (MLP) classification, logistic regression (LR), decision tree (DT), adaptive boosting (AdaBoost) classifier, extreme learning machines (ELMs), and K-nearest neighbor (KNN) are used as classifiers. The results of each distinct base learner were then combined using weighted average and majority voting ensemble methods. Accuracy (ACC), specificity (SPE), sensitivity (SEN), precision (PRE), Matthews correlation coefficient (MCC), and F1-score (F1-S) are used to assess the performance. Our result shows that majority voting achieves better performance than the weighted average ensemble technique. |
format | Article |
id | doaj-art-bcd973a9222c4059a78df51ba7a04231 |
institution | Kabale University |
issn | 1687-6423 |
language | English |
publishDate | 2024-01-01 |
publisher | Wiley |
record_format | Article |
series | International Journal of Telemedicine and Applications |
spelling | doaj-art-bcd973a9222c4059a78df51ba7a042312025-02-03T05:42:21ZengWileyInternational Journal of Telemedicine and Applications1687-64232024-01-01202410.1155/2024/4105224Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray DataPinakshi Panda0Sukant Kishoro Bisoy1Sandeep Kautish2Reyaz Ahmad3Asma Irshad4Nadeem Sarwar5Department of Computer Science & EngineeringDepartment of Computer Science & EngineeringApex Institute of TechnologySchool of General EducationSchool of Biochemistry and BiotechnologyDepartment of Computer ScienceCancer is the top cause of death worldwide, and machine learning (ML) has made an indelible mark on the field of early cancer detection, thereby lowering the death toll. ML-based model for cancer diagnosis is done using two forms of data: gene expression data and microarray data. The data on gene expression levels includes many dimensions. When dealing with data with a high dimension, the efficiency of an ML-based model is decreased. Microarray data is distinguished by its high dimensionality with a greater number of features and a smaller sample size. In this work, two ensemble techniques are proposed using majority voting technique and weighted average technique. Correlation feature selection (CFS) is used for feature selection, and improved grey wolf optimizer (IGWO) is used for feature optimization. Support vector machines (SVMs), multilayer perceptron (MLP) classification, logistic regression (LR), decision tree (DT), adaptive boosting (AdaBoost) classifier, extreme learning machines (ELMs), and K-nearest neighbor (KNN) are used as classifiers. The results of each distinct base learner were then combined using weighted average and majority voting ensemble methods. Accuracy (ACC), specificity (SPE), sensitivity (SEN), precision (PRE), Matthews correlation coefficient (MCC), and F1-score (F1-S) are used to assess the performance. Our result shows that majority voting achieves better performance than the weighted average ensemble technique.http://dx.doi.org/10.1155/2024/4105224 |
spellingShingle | Pinakshi Panda Sukant Kishoro Bisoy Sandeep Kautish Reyaz Ahmad Asma Irshad Nadeem Sarwar Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data International Journal of Telemedicine and Applications |
title | Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data |
title_full | Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data |
title_fullStr | Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data |
title_full_unstemmed | Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data |
title_short | Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data |
title_sort | ensemble classification model with cfs igwo based feature selection for cancer detection using microarray data |
url | http://dx.doi.org/10.1155/2024/4105224 |
work_keys_str_mv | AT pinakshipanda ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata AT sukantkishorobisoy ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata AT sandeepkautish ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata AT reyazahmad ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata AT asmairshad ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata AT nadeemsarwar ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata |