Current Trends in Class Imbalance Learning for Software Defect Prediction

Software defect prediction is of high importance to manage the software development efforts by focusing the testing efforts on the fault-prone modules. Imbalanced defect data causes detrimental impact on the performance of software defect predictors. Researchers deployed a diverse range of learning...

Full description

Saved in:
Bibliographic Details
Main Author: Somya R. Goyal
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10847860/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832576799617318912
author Somya R. Goyal
author_facet Somya R. Goyal
author_sort Somya R. Goyal
collection DOAJ
description Software defect prediction is of high importance to manage the software development efforts by focusing the testing efforts on the fault-prone modules. Imbalanced defect data causes detrimental impact on the performance of software defect predictors. Researchers deployed a diverse range of learning methods to tackle the class-imbalance issues resulting into the remarkable achievements improving the performance of defect prediction models. An analysis on the current trends addressing the Class Imbalanced Learning methods is essential in domain of SDP. This article presents a review of the contributions made over the period from year 2019 to 2024 towards handling class imbalance while applying learning machines for SDP. This study will provide current market trends to future researchers to handle class imbalance. This review has uncovered that – The most pronounced datasets are Promise and NASA which are highly imbalanced in nature. Hybridization of data-sampling techniques with ensembles is effective to handle the class imbalance issue. Among the available evaluation metrics, Area under the Curve is the most used one as it is insulated from the impact of imbalanced datasets. Deep learning models have potential and prospects to be explored for class imbalance handling in SDP in a full-fledged capacity.
format Article
id doaj-art-fa7c7e71359047b9b95df10ea2d48f61
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-fa7c7e71359047b9b95df10ea2d48f612025-01-31T00:01:24ZengIEEEIEEE Access2169-35362025-01-0113168961691710.1109/ACCESS.2025.353225010847860Current Trends in Class Imbalance Learning for Software Defect PredictionSomya R. Goyal0https://orcid.org/0000-0002-0113-7733Department of Computer and Communication Engineering, Manipal University Jaipur, Jaipur, Rajasthan, IndiaSoftware defect prediction is of high importance to manage the software development efforts by focusing the testing efforts on the fault-prone modules. Imbalanced defect data causes detrimental impact on the performance of software defect predictors. Researchers deployed a diverse range of learning methods to tackle the class-imbalance issues resulting into the remarkable achievements improving the performance of defect prediction models. An analysis on the current trends addressing the Class Imbalanced Learning methods is essential in domain of SDP. This article presents a review of the contributions made over the period from year 2019 to 2024 towards handling class imbalance while applying learning machines for SDP. This study will provide current market trends to future researchers to handle class imbalance. This review has uncovered that – The most pronounced datasets are Promise and NASA which are highly imbalanced in nature. Hybridization of data-sampling techniques with ensembles is effective to handle the class imbalance issue. Among the available evaluation metrics, Area under the Curve is the most used one as it is insulated from the impact of imbalanced datasets. Deep learning models have potential and prospects to be explored for class imbalance handling in SDP in a full-fledged capacity.https://ieeexplore.ieee.org/document/10847860/Class imbalanceensemble learningevidence based software engineering (EBSE)samplingsoftware defect prediction (SDP)systematic literature review (SLR)
spellingShingle Somya R. Goyal
Current Trends in Class Imbalance Learning for Software Defect Prediction
IEEE Access
Class imbalance
ensemble learning
evidence based software engineering (EBSE)
sampling
software defect prediction (SDP)
systematic literature review (SLR)
title Current Trends in Class Imbalance Learning for Software Defect Prediction
title_full Current Trends in Class Imbalance Learning for Software Defect Prediction
title_fullStr Current Trends in Class Imbalance Learning for Software Defect Prediction
title_full_unstemmed Current Trends in Class Imbalance Learning for Software Defect Prediction
title_short Current Trends in Class Imbalance Learning for Software Defect Prediction
title_sort current trends in class imbalance learning for software defect prediction
topic Class imbalance
ensemble learning
evidence based software engineering (EBSE)
sampling
software defect prediction (SDP)
systematic literature review (SLR)
url https://ieeexplore.ieee.org/document/10847860/
work_keys_str_mv AT somyargoyal currenttrendsinclassimbalancelearningforsoftwaredefectprediction