Urdu Toxic Comment Classification With PURUTT Corpus Development
This study addresses the critical gap in toxic comment classification in Urdu, a widely spoken language devoid of high-quality standard datasets. To address this gap, we employed an existing labeled Roman Urdu (RU) corpus, which was developed originally for Roman Urdu toxic comment classification, a...
Saved in:
Main Authors: | Hafiz Hassaan Saeed, Tahir Khalil, Faisal Kamiran |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10856102/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
UEF-HOCUrdu: Unified Embeddings Ensemble Framework for Hate and Offensive Text Classification in Urdu
by: Kifayat Ullah, et al.
Published: (2025-01-01) -
A RHETORICAL ANALYSIS OF COMMENTS AND DELIVERY STRATEGY ON TED TALKS
by: Ildi Kurniawan
Published: (2021-02-01) -
UAlpha40: A comprehensive dataset of Urdu alphabet for Pakistan sign languageMendeley Data
by: Sameena Javaid, et al.
Published: (2025-04-01) -
Urdu Lip Reading Systems for Digits in Controlled and Uncontrolled Environment
by: Amanullah Baloch, et al.
Published: (2025-01-01) -
Maternal exposure to tris (2-butoxyethyl) phosphate induces F0 female reproductive toxicity and offspring developmental toxicity in zebrafish
by: Anqi Dong, et al.
Published: (2025-01-01)