Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method

Abstract Hate speech (HS) has grown because of increasing social media platform usage, which includes Twitter, YouTube, and Facebook. The frequent attempts to implement automated detection systems remain unsuccessful at separating hate speech from objectionable language, because user-generated conte...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad Mubeen, Aliza Muskan, Arslan Akram, Javed Rashid, Tagrid Abdullah N. Alshalali, Nadeem Sarwar
Format: Article
Language:English
Published: Springer 2025-07-01
Series:International Journal of Computational Intelligence Systems
Subjects:
Online Access:https://doi.org/10.1007/s44196-025-00919-z
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849389585404002304
author Muhammad Mubeen
Aliza Muskan
Arslan Akram
Javed Rashid
Tagrid Abdullah N. Alshalali
Nadeem Sarwar
author_facet Muhammad Mubeen
Aliza Muskan
Arslan Akram
Javed Rashid
Tagrid Abdullah N. Alshalali
Nadeem Sarwar
author_sort Muhammad Mubeen
collection DOAJ
description Abstract Hate speech (HS) has grown because of increasing social media platform usage, which includes Twitter, YouTube, and Facebook. The frequent attempts to implement automated detection systems remain unsuccessful at separating hate speech from objectionable language, because user-generated content tends toward informal, brief, and diverse expressions. The determination of hate speech within texts proves exceptionally hard, since precise context detection is needed to distinguish abusive language from neutral statements. Precision in hate speech identification and filtering stands essential, because these online content forms have negative impacts on both minority and majority groups while heightening their conflicts. The research presents a stacked ensemble classification system that classifies tweets into three groups: hate speech, abusive language, or neutral. The framework uses term frequency–inverse document frequency (TF–IDF) extracted from tweet texts for which support vector machine (SVM), together with Random Forest, XGBoost, and Logistic Regression, base machine learning models function as classifiers. The final model outcome results from linking several base learning models into an ensemble configuration. The Kaggle Hate Speech data set served as training material for the system, because it contained 24,784 tweets along with eight attributes. The model performance received improvement through exclusion of manually derived features. The proposed ensemble model demonstrated superior performance with 96% accuracy, while each single classifier had lower accuracy rates (SVM: 93%, Random Forest: 94%, and XGBoost: 88%). The research outcomes show stacking represents an effective method to enhance systems for detecting hate speech operating on social media platforms.
format Article
id doaj-art-990e83c50eb04e9483d1da2d4e99d1d4
institution Kabale University
issn 1875-6883
language English
publishDate 2025-07-01
publisher Springer
record_format Article
series International Journal of Computational Intelligence Systems
spelling doaj-art-990e83c50eb04e9483d1da2d4e99d1d42025-08-20T03:41:56ZengSpringerInternational Journal of Computational Intelligence Systems1875-68832025-07-0118112410.1007/s44196-025-00919-zCyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification MethodMuhammad Mubeen0Aliza Muskan1Arslan Akram2Javed Rashid3Tagrid Abdullah N. Alshalali4Nadeem Sarwar5Department of Computer Science, University of PeopleDepartment of Computer Science, University of PeopleDepartment of Computer Science, University of PeopleInformation Technology Services, University of OkaraDepartment of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman UniversityDepartment of Computer Science, Bahria University Lahore CampusAbstract Hate speech (HS) has grown because of increasing social media platform usage, which includes Twitter, YouTube, and Facebook. The frequent attempts to implement automated detection systems remain unsuccessful at separating hate speech from objectionable language, because user-generated content tends toward informal, brief, and diverse expressions. The determination of hate speech within texts proves exceptionally hard, since precise context detection is needed to distinguish abusive language from neutral statements. Precision in hate speech identification and filtering stands essential, because these online content forms have negative impacts on both minority and majority groups while heightening their conflicts. The research presents a stacked ensemble classification system that classifies tweets into three groups: hate speech, abusive language, or neutral. The framework uses term frequency–inverse document frequency (TF–IDF) extracted from tweet texts for which support vector machine (SVM), together with Random Forest, XGBoost, and Logistic Regression, base machine learning models function as classifiers. The final model outcome results from linking several base learning models into an ensemble configuration. The Kaggle Hate Speech data set served as training material for the system, because it contained 24,784 tweets along with eight attributes. The model performance received improvement through exclusion of manually derived features. The proposed ensemble model demonstrated superior performance with 96% accuracy, while each single classifier had lower accuracy rates (SVM: 93%, Random Forest: 94%, and XGBoost: 88%). The research outcomes show stacking represents an effective method to enhance systems for detecting hate speech operating on social media platforms.https://doi.org/10.1007/s44196-025-00919-zHate-speechSupport vector machineMachine learningNatural language processingHate-speech prediction
spellingShingle Muhammad Mubeen
Aliza Muskan
Arslan Akram
Javed Rashid
Tagrid Abdullah N. Alshalali
Nadeem Sarwar
Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method
International Journal of Computational Intelligence Systems
Hate-speech
Support vector machine
Machine learning
Natural language processing
Hate-speech prediction
title Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method
title_full Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method
title_fullStr Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method
title_full_unstemmed Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method
title_short Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method
title_sort cyberbullying related automated hate speech detection on social media platforms using stack ensemble classification method
topic Hate-speech
Support vector machine
Machine learning
Natural language processing
Hate-speech prediction
url https://doi.org/10.1007/s44196-025-00919-z
work_keys_str_mv AT muhammadmubeen cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod
AT alizamuskan cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod
AT arslanakram cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod
AT javedrashid cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod
AT tagridabdullahnalshalali cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod
AT nadeemsarwar cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod