Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online news

The research aimed to extract semantic fields from Arabic online news and advance Natural Language Processing (NLP) applications in understanding and managing news information effectively. It provides a comprehensive approach to processing and analyzing large volumes of Arabic news data by integrati...

Full description

Saved in:
Bibliographic Details
Main Authors: Maulana Ihsan Ahmad, Moh. Kanif Anwari
Format: Article
Language:English
Published: Universitas Syiah Kuala 2024-09-01
Series:Studies in English Language and Education
Subjects:
Online Access:https://jurnal.usk.ac.id/SiELE/article/view/38090
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832583568831807488
author Maulana Ihsan Ahmad
Moh. Kanif Anwari
author_facet Maulana Ihsan Ahmad
Moh. Kanif Anwari
author_sort Maulana Ihsan Ahmad
collection DOAJ
description The research aimed to extract semantic fields from Arabic online news and advance Natural Language Processing (NLP) applications in understanding and managing news information effectively. It provides a comprehensive approach to processing and analyzing large volumes of Arabic news data by integrating semantic field analysis, NLP, and computational linguistics. Using quantitative methods, Arabic news articles were collected and processed with Python, a popular programming language in data analysis, and applied various NLP techniques and machine learning models to accurately extract semantic fields. The primary objective was to evaluate the effectiveness of different classification models in categorizing Arabic news and to identify the most suitable model for semantic field extraction. The research evaluated five classification models: Naive Bayes, Support Vector Machine (SVM), Logistic Regression, Random Forest, and Gradient Boosting. Among these, SVM achieves the highest overall accuracy of 90%. Specifically, SVM demonstrated exceptional performance in categorizing sports-related news, with a 99% probability and an F1-Score of 98%. However, it faced challenges in categorizing health and science news, achieving a lower F1-Score of 79%. Overall, the study demonstrated the effectiveness of computational methods, particularly SVM, in classifying Arabic news and extracting semantic fields, thereby advancing NLP and computational linguistics. The findings highlighted the potential of SVM for accurate news analysis and the need for further enhancement of NLP techniques to address multilingual and domain-specific challenges.
format Article
id doaj-art-f1691625efbd41b6bcf1728806b17fd2
institution Kabale University
issn 2355-2794
2461-0275
language English
publishDate 2024-09-01
publisher Universitas Syiah Kuala
record_format Article
series Studies in English Language and Education
spelling doaj-art-f1691625efbd41b6bcf1728806b17fd22025-01-28T10:47:38ZengUniversitas Syiah KualaStudies in English Language and Education2355-27942461-02752024-09-011131685170910.24815/siele.v11i3.3809018962Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online newsMaulana Ihsan Ahmad0Moh. Kanif Anwari1State Islamic University of Sunan KalijagaState Islamic University of Sunan KalijagaThe research aimed to extract semantic fields from Arabic online news and advance Natural Language Processing (NLP) applications in understanding and managing news information effectively. It provides a comprehensive approach to processing and analyzing large volumes of Arabic news data by integrating semantic field analysis, NLP, and computational linguistics. Using quantitative methods, Arabic news articles were collected and processed with Python, a popular programming language in data analysis, and applied various NLP techniques and machine learning models to accurately extract semantic fields. The primary objective was to evaluate the effectiveness of different classification models in categorizing Arabic news and to identify the most suitable model for semantic field extraction. The research evaluated five classification models: Naive Bayes, Support Vector Machine (SVM), Logistic Regression, Random Forest, and Gradient Boosting. Among these, SVM achieves the highest overall accuracy of 90%. Specifically, SVM demonstrated exceptional performance in categorizing sports-related news, with a 99% probability and an F1-Score of 98%. However, it faced challenges in categorizing health and science news, achieving a lower F1-Score of 79%. Overall, the study demonstrated the effectiveness of computational methods, particularly SVM, in classifying Arabic news and extracting semantic fields, thereby advancing NLP and computational linguistics. The findings highlighted the potential of SVM for accurate news analysis and the need for further enhancement of NLP techniques to address multilingual and domain-specific challenges.https://jurnal.usk.ac.id/SiELE/article/view/38090computational linguisticsnatural language processingsemantic field
spellingShingle Maulana Ihsan Ahmad
Moh. Kanif Anwari
Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online news
Studies in English Language and Education
computational linguistics
natural language processing
semantic field
title Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online news
title_full Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online news
title_fullStr Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online news
title_full_unstemmed Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online news
title_short Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online news
title_sort computational linguistics and natural language processing techniques for semantic field extraction in arabic online news
topic computational linguistics
natural language processing
semantic field
url https://jurnal.usk.ac.id/SiELE/article/view/38090
work_keys_str_mv AT maulanaihsanahmad computationallinguisticsandnaturallanguageprocessingtechniquesforsemanticfieldextractioninarabiconlinenews
AT mohkanifanwari computationallinguisticsandnaturallanguageprocessingtechniquesforsemanticfieldextractioninarabiconlinenews