Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data

Cancer is the top cause of death worldwide, and machine learning (ML) has made an indelible mark on the field of early cancer detection, thereby lowering the death toll. ML-based model for cancer diagnosis is done using two forms of data: gene expression data and microarray data. The data on gene ex...

Full description

Saved in:
Bibliographic Details
Main Authors: Pinakshi Panda, Sukant Kishoro Bisoy, Sandeep Kautish, Reyaz Ahmad, Asma Irshad, Nadeem Sarwar
Format: Article
Language:English
Published: Wiley 2024-01-01
Series:International Journal of Telemedicine and Applications
Online Access:http://dx.doi.org/10.1155/2024/4105224
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832557192939569152
author Pinakshi Panda
Sukant Kishoro Bisoy
Sandeep Kautish
Reyaz Ahmad
Asma Irshad
Nadeem Sarwar
author_facet Pinakshi Panda
Sukant Kishoro Bisoy
Sandeep Kautish
Reyaz Ahmad
Asma Irshad
Nadeem Sarwar
author_sort Pinakshi Panda
collection DOAJ
description Cancer is the top cause of death worldwide, and machine learning (ML) has made an indelible mark on the field of early cancer detection, thereby lowering the death toll. ML-based model for cancer diagnosis is done using two forms of data: gene expression data and microarray data. The data on gene expression levels includes many dimensions. When dealing with data with a high dimension, the efficiency of an ML-based model is decreased. Microarray data is distinguished by its high dimensionality with a greater number of features and a smaller sample size. In this work, two ensemble techniques are proposed using majority voting technique and weighted average technique. Correlation feature selection (CFS) is used for feature selection, and improved grey wolf optimizer (IGWO) is used for feature optimization. Support vector machines (SVMs), multilayer perceptron (MLP) classification, logistic regression (LR), decision tree (DT), adaptive boosting (AdaBoost) classifier, extreme learning machines (ELMs), and K-nearest neighbor (KNN) are used as classifiers. The results of each distinct base learner were then combined using weighted average and majority voting ensemble methods. Accuracy (ACC), specificity (SPE), sensitivity (SEN), precision (PRE), Matthews correlation coefficient (MCC), and F1-score (F1-S) are used to assess the performance. Our result shows that majority voting achieves better performance than the weighted average ensemble technique.
format Article
id doaj-art-bcd973a9222c4059a78df51ba7a04231
institution Kabale University
issn 1687-6423
language English
publishDate 2024-01-01
publisher Wiley
record_format Article
series International Journal of Telemedicine and Applications
spelling doaj-art-bcd973a9222c4059a78df51ba7a042312025-02-03T05:42:21ZengWileyInternational Journal of Telemedicine and Applications1687-64232024-01-01202410.1155/2024/4105224Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray DataPinakshi Panda0Sukant Kishoro Bisoy1Sandeep Kautish2Reyaz Ahmad3Asma Irshad4Nadeem Sarwar5Department of Computer Science & EngineeringDepartment of Computer Science & EngineeringApex Institute of TechnologySchool of General EducationSchool of Biochemistry and BiotechnologyDepartment of Computer ScienceCancer is the top cause of death worldwide, and machine learning (ML) has made an indelible mark on the field of early cancer detection, thereby lowering the death toll. ML-based model for cancer diagnosis is done using two forms of data: gene expression data and microarray data. The data on gene expression levels includes many dimensions. When dealing with data with a high dimension, the efficiency of an ML-based model is decreased. Microarray data is distinguished by its high dimensionality with a greater number of features and a smaller sample size. In this work, two ensemble techniques are proposed using majority voting technique and weighted average technique. Correlation feature selection (CFS) is used for feature selection, and improved grey wolf optimizer (IGWO) is used for feature optimization. Support vector machines (SVMs), multilayer perceptron (MLP) classification, logistic regression (LR), decision tree (DT), adaptive boosting (AdaBoost) classifier, extreme learning machines (ELMs), and K-nearest neighbor (KNN) are used as classifiers. The results of each distinct base learner were then combined using weighted average and majority voting ensemble methods. Accuracy (ACC), specificity (SPE), sensitivity (SEN), precision (PRE), Matthews correlation coefficient (MCC), and F1-score (F1-S) are used to assess the performance. Our result shows that majority voting achieves better performance than the weighted average ensemble technique.http://dx.doi.org/10.1155/2024/4105224
spellingShingle Pinakshi Panda
Sukant Kishoro Bisoy
Sandeep Kautish
Reyaz Ahmad
Asma Irshad
Nadeem Sarwar
Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data
International Journal of Telemedicine and Applications
title Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data
title_full Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data
title_fullStr Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data
title_full_unstemmed Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data
title_short Ensemble Classification Model With CFS-IGWO–Based Feature Selection for Cancer Detection Using Microarray Data
title_sort ensemble classification model with cfs igwo based feature selection for cancer detection using microarray data
url http://dx.doi.org/10.1155/2024/4105224
work_keys_str_mv AT pinakshipanda ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata
AT sukantkishorobisoy ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata
AT sandeepkautish ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata
AT reyazahmad ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata
AT asmairshad ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata
AT nadeemsarwar ensembleclassificationmodelwithcfsigwobasedfeatureselectionforcancerdetectionusingmicroarraydata