Binary plant rhizome growth-based optimization algorithm: an efficient high-dimensional feature selection approach

Abstract Feature selection is a pivotal research area within machine learning, tasked with pinpointing the essential subset of features from a broad array that critically influences a model’s predictive capabilities. This process enhances model precision and drastically lowers the computational dema...

Full description

Saved in:
Bibliographic Details
Main Authors: Jin Zhang, Fu Yan, Jianqiang Yang
Format: Article
Language:English
Published: SpringerOpen 2025-01-01
Series:Journal of Big Data
Subjects:
Online Access:https://doi.org/10.1186/s40537-025-01066-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832585618930008064
author Jin Zhang
Fu Yan
Jianqiang Yang
author_facet Jin Zhang
Fu Yan
Jianqiang Yang
author_sort Jin Zhang
collection DOAJ
description Abstract Feature selection is a pivotal research area within machine learning, tasked with pinpointing the essential subset of features from a broad array that critically influences a model’s predictive capabilities. This process enhances model precision and drastically lowers the computational demands associated with training and predicting. Consequently, more advanced optimization techniques are employed to address the challenge of feature selection. This paper introduces an innovative intelligent optimization algorithm, the Plant Root Growth Optimization (PRGO) algorithm, inspired by the structure of plant rhizomes and the way they absorb nutrients.In the algorithm, the plant rhizomes are divided into two categories, the taproot and the fibrous root.the growth process of the taproot plants is associated with the global exploration search, and the growth process of the fibrous root plants relates to the local exploitation search.The global asymptotic convergence of the algorithm is proved by applying Markov’s correlation theory, and simulation results using CEC2014 and CEC2017 test sets show that the proposed algorithm has excellent performance.Moreover, a binary variant of this algorithm (BPRGO) has been specifically crafted in this research to tackle the complexities of high-dimensional feature selection issues. The algorithm was compared to eight well-known feature selection methods and its performance was evaluated using a variety of evaluation metrics on 16 high-dimensional datasets from the Arizona State University feature selection library. and the performance of the proposed algorithm was evaluated through feature subset size, classification accuracy, fitness value, and F1-score. The experimental results show that BPRGO achieves the best performance, which has stronger feature reduction ability and achieves better overall performance on most datasets. BPRGO can obtain extremely smaller feature subsets while maintaining much higher classification accuracy, and satisfactory F1-score.
format Article
id doaj-art-202dbb7c15054df78b1c3d2062339aed
institution Kabale University
issn 2196-1115
language English
publishDate 2025-01-01
publisher SpringerOpen
record_format Article
series Journal of Big Data
spelling doaj-art-202dbb7c15054df78b1c3d2062339aed2025-01-26T12:37:45ZengSpringerOpenJournal of Big Data2196-11152025-01-0112114310.1186/s40537-025-01066-0Binary plant rhizome growth-based optimization algorithm: an efficient high-dimensional feature selection approachJin Zhang0Fu Yan1Jianqiang Yang2School of Mathematics and Statistics, Guizhou UniversityGuizhou Big Data Academy, Guizhou UniversitySchool of Mathematics and Statistics, Guizhou UniversityAbstract Feature selection is a pivotal research area within machine learning, tasked with pinpointing the essential subset of features from a broad array that critically influences a model’s predictive capabilities. This process enhances model precision and drastically lowers the computational demands associated with training and predicting. Consequently, more advanced optimization techniques are employed to address the challenge of feature selection. This paper introduces an innovative intelligent optimization algorithm, the Plant Root Growth Optimization (PRGO) algorithm, inspired by the structure of plant rhizomes and the way they absorb nutrients.In the algorithm, the plant rhizomes are divided into two categories, the taproot and the fibrous root.the growth process of the taproot plants is associated with the global exploration search, and the growth process of the fibrous root plants relates to the local exploitation search.The global asymptotic convergence of the algorithm is proved by applying Markov’s correlation theory, and simulation results using CEC2014 and CEC2017 test sets show that the proposed algorithm has excellent performance.Moreover, a binary variant of this algorithm (BPRGO) has been specifically crafted in this research to tackle the complexities of high-dimensional feature selection issues. The algorithm was compared to eight well-known feature selection methods and its performance was evaluated using a variety of evaluation metrics on 16 high-dimensional datasets from the Arizona State University feature selection library. and the performance of the proposed algorithm was evaluated through feature subset size, classification accuracy, fitness value, and F1-score. The experimental results show that BPRGO achieves the best performance, which has stronger feature reduction ability and achieves better overall performance on most datasets. BPRGO can obtain extremely smaller feature subsets while maintaining much higher classification accuracy, and satisfactory F1-score.https://doi.org/10.1186/s40537-025-01066-0Feature selectionBinary plant root growth optimization algorithmHigh dimensional dataOptimization
spellingShingle Jin Zhang
Fu Yan
Jianqiang Yang
Binary plant rhizome growth-based optimization algorithm: an efficient high-dimensional feature selection approach
Journal of Big Data
Feature selection
Binary plant root growth optimization algorithm
High dimensional data
Optimization
title Binary plant rhizome growth-based optimization algorithm: an efficient high-dimensional feature selection approach
title_full Binary plant rhizome growth-based optimization algorithm: an efficient high-dimensional feature selection approach
title_fullStr Binary plant rhizome growth-based optimization algorithm: an efficient high-dimensional feature selection approach
title_full_unstemmed Binary plant rhizome growth-based optimization algorithm: an efficient high-dimensional feature selection approach
title_short Binary plant rhizome growth-based optimization algorithm: an efficient high-dimensional feature selection approach
title_sort binary plant rhizome growth based optimization algorithm an efficient high dimensional feature selection approach
topic Feature selection
Binary plant root growth optimization algorithm
High dimensional data
Optimization
url https://doi.org/10.1186/s40537-025-01066-0
work_keys_str_mv AT jinzhang binaryplantrhizomegrowthbasedoptimizationalgorithmanefficienthighdimensionalfeatureselectionapproach
AT fuyan binaryplantrhizomegrowthbasedoptimizationalgorithmanefficienthighdimensionalfeatureselectionapproach
AT jianqiangyang binaryplantrhizomegrowthbasedoptimizationalgorithmanefficienthighdimensionalfeatureselectionapproach