Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method
Abstract Hate speech (HS) has grown because of increasing social media platform usage, which includes Twitter, YouTube, and Facebook. The frequent attempts to implement automated detection systems remain unsuccessful at separating hate speech from objectionable language, because user-generated conte...
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Springer
2025-07-01
|
| Series: | International Journal of Computational Intelligence Systems |
| Subjects: | |
| Online Access: | https://doi.org/10.1007/s44196-025-00919-z |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849389585404002304 |
|---|---|
| author | Muhammad Mubeen Aliza Muskan Arslan Akram Javed Rashid Tagrid Abdullah N. Alshalali Nadeem Sarwar |
| author_facet | Muhammad Mubeen Aliza Muskan Arslan Akram Javed Rashid Tagrid Abdullah N. Alshalali Nadeem Sarwar |
| author_sort | Muhammad Mubeen |
| collection | DOAJ |
| description | Abstract Hate speech (HS) has grown because of increasing social media platform usage, which includes Twitter, YouTube, and Facebook. The frequent attempts to implement automated detection systems remain unsuccessful at separating hate speech from objectionable language, because user-generated content tends toward informal, brief, and diverse expressions. The determination of hate speech within texts proves exceptionally hard, since precise context detection is needed to distinguish abusive language from neutral statements. Precision in hate speech identification and filtering stands essential, because these online content forms have negative impacts on both minority and majority groups while heightening their conflicts. The research presents a stacked ensemble classification system that classifies tweets into three groups: hate speech, abusive language, or neutral. The framework uses term frequency–inverse document frequency (TF–IDF) extracted from tweet texts for which support vector machine (SVM), together with Random Forest, XGBoost, and Logistic Regression, base machine learning models function as classifiers. The final model outcome results from linking several base learning models into an ensemble configuration. The Kaggle Hate Speech data set served as training material for the system, because it contained 24,784 tweets along with eight attributes. The model performance received improvement through exclusion of manually derived features. The proposed ensemble model demonstrated superior performance with 96% accuracy, while each single classifier had lower accuracy rates (SVM: 93%, Random Forest: 94%, and XGBoost: 88%). The research outcomes show stacking represents an effective method to enhance systems for detecting hate speech operating on social media platforms. |
| format | Article |
| id | doaj-art-990e83c50eb04e9483d1da2d4e99d1d4 |
| institution | Kabale University |
| issn | 1875-6883 |
| language | English |
| publishDate | 2025-07-01 |
| publisher | Springer |
| record_format | Article |
| series | International Journal of Computational Intelligence Systems |
| spelling | doaj-art-990e83c50eb04e9483d1da2d4e99d1d42025-08-20T03:41:56ZengSpringerInternational Journal of Computational Intelligence Systems1875-68832025-07-0118112410.1007/s44196-025-00919-zCyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification MethodMuhammad Mubeen0Aliza Muskan1Arslan Akram2Javed Rashid3Tagrid Abdullah N. Alshalali4Nadeem Sarwar5Department of Computer Science, University of PeopleDepartment of Computer Science, University of PeopleDepartment of Computer Science, University of PeopleInformation Technology Services, University of OkaraDepartment of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman UniversityDepartment of Computer Science, Bahria University Lahore CampusAbstract Hate speech (HS) has grown because of increasing social media platform usage, which includes Twitter, YouTube, and Facebook. The frequent attempts to implement automated detection systems remain unsuccessful at separating hate speech from objectionable language, because user-generated content tends toward informal, brief, and diverse expressions. The determination of hate speech within texts proves exceptionally hard, since precise context detection is needed to distinguish abusive language from neutral statements. Precision in hate speech identification and filtering stands essential, because these online content forms have negative impacts on both minority and majority groups while heightening their conflicts. The research presents a stacked ensemble classification system that classifies tweets into three groups: hate speech, abusive language, or neutral. The framework uses term frequency–inverse document frequency (TF–IDF) extracted from tweet texts for which support vector machine (SVM), together with Random Forest, XGBoost, and Logistic Regression, base machine learning models function as classifiers. The final model outcome results from linking several base learning models into an ensemble configuration. The Kaggle Hate Speech data set served as training material for the system, because it contained 24,784 tweets along with eight attributes. The model performance received improvement through exclusion of manually derived features. The proposed ensemble model demonstrated superior performance with 96% accuracy, while each single classifier had lower accuracy rates (SVM: 93%, Random Forest: 94%, and XGBoost: 88%). The research outcomes show stacking represents an effective method to enhance systems for detecting hate speech operating on social media platforms.https://doi.org/10.1007/s44196-025-00919-zHate-speechSupport vector machineMachine learningNatural language processingHate-speech prediction |
| spellingShingle | Muhammad Mubeen Aliza Muskan Arslan Akram Javed Rashid Tagrid Abdullah N. Alshalali Nadeem Sarwar Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method International Journal of Computational Intelligence Systems Hate-speech Support vector machine Machine learning Natural language processing Hate-speech prediction |
| title | Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method |
| title_full | Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method |
| title_fullStr | Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method |
| title_full_unstemmed | Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method |
| title_short | Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method |
| title_sort | cyberbullying related automated hate speech detection on social media platforms using stack ensemble classification method |
| topic | Hate-speech Support vector machine Machine learning Natural language processing Hate-speech prediction |
| url | https://doi.org/10.1007/s44196-025-00919-z |
| work_keys_str_mv | AT muhammadmubeen cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod AT alizamuskan cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod AT arslanakram cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod AT javedrashid cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod AT tagridabdullahnalshalali cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod AT nadeemsarwar cyberbullyingrelatedautomatedhatespeechdetectiononsocialmediaplatformsusingstackensembleclassificationmethod |