ENSEMBLE RESAMPLING SUPPORT VECTOR MACHINE, MULTINOMIAL REGRESSION TO MULTICLASS IMBALANCED DATA

Imbalanced data is a commonly encountered issue in classification analysis. This issue gives rise to prediction errors in the classification process, which in turn affects the sensitivity, particularly in the minority class. Resampling techniques can be employed as a means to mitigate the issue of I...

Full description

Saved in:
Bibliographic Details
Main Authors: Laila Qadrini, Hikmah Hikmah, Elviani Tande, Ignasius Presda, Aulia Atika Maghfirah, Nilawati Nilawati, Handayani Handayani
Format: Article
Language:English
Published: Universitas Pattimura 2024-03-01
Series:Barekeng
Subjects:
Online Access:https://ojs3.unpatti.ac.id/index.php/barekeng/article/view/10285
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Imbalanced data is a commonly encountered issue in classification analysis. This issue gives rise to prediction errors in the classification process, which in turn affects the sensitivity, particularly in the minority class. Resampling techniques can be employed as a means to mitigate the issue of Imbalanced data. Furthermore, ensemble approaches are Utilized in the classification procedure to augment the performance of classification. The present study assesses the efficacy of the bagging ensemble approach in conjunction with ADASYN as a means of addressing the aforementioned issue. The dataset Utilized in this work comprises Imbalanced Glass Identification data, Imbalanced Iris data, and Imbalanced synthetic data. The study Centres on the Utilization of Support Vector Machines (SVM) with parameter optimization using repeated cross-validation (k = 10) and the application of multinomial regression. The evaluation of classification outcomes involves a comparison between the ensemble technique and multinomial regression. This comparison is conducted under pre- and post-resampling conditions, with the evaluation metrics being accuracy, sensitivity, and specificity. The analysis of classification outcomes across the three datasets suggests that the ensemble resampling SVM approach and multinomial regression exhibit superior performance compared to the ensemble SVM and multinomial regression approaches when applied to non-resampled data. Resampling of data has been observed to enhance sensitivity, particularly in the minority class.
ISSN:1978-7227
2615-3017