How well can LLMs grade essays in Arabic?

How well can LLMs grade essays in Arabic?

This research assesses the effectiveness of state-of-the-art large language models (LLMs), including ChatGPT, Llama, Aya, Jais, and ACEGPT, in the task of Arabic automated essay scoring (AES) using the AR-AES dataset. It explores various evaluation methodologies, including zero-shot, few-shot in con...

Full description

Saved in:

Bibliographic Details
Main Authors:	Rayed Ghazawi, Edwin Simpson
Format:	Article
Language:	English
Published:	Elsevier 2025-12-01
Series:	Computers and Education: Artificial Intelligence
Subjects:	Automatic essay scoring (AES) Natural language processing (NLP) Large language models (LLMs) Arabic language
Online Access:	http://www.sciencedirect.com/science/article/pii/S2666920X2500089X
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Integrating Large Language Models in Political Discourse Studies on Social Media: Challenges of Validating an LLMs-in-the-loop Pipeline
by: Giada Marino, et al.
Published: (2024-10-01)

ET-GNN: Ensemble Transformer-Based Graph Neural Networks for Holistic Automated Essay Scoring
by: Hind Aljuaid, et al.
Published: (2025-01-01)

Assisting tool for essay grading for Turkish language instructors
by: Mustafa Alp Çetin, et al.
Published: (2019-12-01)

BERTugues: A Novel BERT Transformer Model Pre-trained for Brazilian Portuguese
by: Ricardo Mazza Zago, et al.
Published: (2024-12-01)

CacheFormer: High-Attention-Based Segment Caching
by: Sushant Singh, et al.
Published: (2025-04-01)

In-Context Learning in Large Language Models (LLMs): Mechanisms, Capabilities, and Implications for Advanced Knowledge Representation and Reasoning
by: Azza Mohamed, et al.
Published: (2025-01-01)

An LLM-based hybrid approach for enhanced automated essay scoring
by: John Atkinson, et al.
Published: (2025-04-01)

Editorial: Large Language Models for medical applications
by: Ariel Soares Teles, et al.
Published: (2025-05-01)

Systematic Analysis of Retrieval-Augmented Generation-Based LLMs for Medical Chatbot Applications
by: Arunabh Bora, et al.
Published: (2024-10-01)

AutoTA: A Dynamic Intent-Based Virtual Teaching Assistant for Students Using Open Source LLMs
by: Rajashree Dahal, et al.
Published: (2025-01-01)

Evaluating large language models for renal colic imaging recommendations: a comparative analysis of Gemini, copilot, and ChatGPT-4.0
by: Yavuz Yigit, et al.
Published: (2025-07-01)

Supporting energy policy research with large language models: A case study in wind energy siting ordinances
by: Grant Buster, et al.
Published: (2024-12-01)

Statistics is not measurement: The inbuilt semantics of psychometric scales and language-based models obscures crucial epistemic differences
by: Jana Uher
Published: (2025-06-01)

Automated essay scoring with SBERT embeddings and LSTM-Attention networks
by: Yuzhe Nie
Published: (2025-02-01)

Large language models in breast cancer reconstruction: A framework for patient-specific recovery and predictive insights
by: Chunrao Zheng, et al.
Published: (2025-06-01)

Combining the Strengths of LLMs and Persuasive Technology to Combat Cyberhate
by: Malik Almaliki, et al.
Published: (2025-05-01)

Constructing and evaluating ArabicStanceX: a social media dataset for Arabic stance detection
by: Ali Alkhathlan, et al.
Published: (2025-06-01)

Developing a Multi-Layer Ontology Construction Framework for Arabic Language Processing: Focus on Figurative Language Potential
by: Zouheir Banou, et al.
Published: (2025-01-01)

Towards a benchmark dataset for large language models in the context of process automation
by: Tejennour Tizaoui, et al.
Published: (2024-12-01)

Toward HydroLLM: a benchmark dataset for hydrology-specific knowledge assessment for large language models
by: Dilara Kizilkaya, et al.
Published: (2025-01-01)

Large Language Models (LLMs) and Causality Extraction from Text
by: Wlodek Zadrozny
Published: (2025-05-01)

Addressing Activation Outliers in LLMs: A Systematic Review of Post-Training Quantization Techniques
by: Patrik Czako, et al.
Published: (2025-01-01)

LLMs in Cyber Security: Bridging Practice and Education
by: Hany F. Atlam
Published: (2025-07-01)

Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm
by: Sari Masri, et al.
Published: (2025-01-01)

LLMs in Education: Evaluation GPT and BERT Models in Student Comment Classification
by: Anabel Pilicita, et al.
Published: (2025-05-01)

Leveraging LLMs for COVID-19 Fake News Generation and Detection: A Comparative Analysis on Twitter Data
by: Hong N. Dao, et al.
Published: (2025-01-01)

Max–Min semantic chunking of documents for RAG application
by: Csaba Kiss, et al.
Published: (2025-06-01)

Can open source large language models be used for tumor documentation in Germany?—An evaluation on urological doctors’ notes
by: Stefan Lenz, et al.
Published: (2025-07-01)

Can AI provide useful holistic essay scoring?
by: Tamara P. Tate, et al.
Published: (2024-12-01)

On protecting the data privacy of Large Language Models (LLMs) and LLM agents: A literature review
by: Biwei Yan, et al.
Published: (2025-06-01)

Comparative Analysis of Traditional and Modern NLP Techniques on the CoLA Dataset: From POS Tagging to Large Language Models
by: Abdessamad Benlahbib, et al.
Published: (2025-01-01)

Potentials and Challenges of Large Language Models (LLMs) in the Context of Administrative Decision-Making
by: Paulina Jo Pesch, et al.
Published: (2025-03-01)

AI driven cardiovascular risk prediction using NLP and Large Language Models for personalized medicine in athletes
by: Ang Li, et al.
Published: (2025-06-01)

Smart Building Recommendations with LLMs: A Semantic Comparison Approach
by: Ioannis Papaioannou, et al.
Published: (2025-06-01)

A Bibliometric Exposition and Review on Leveraging LLMs for Programming Education
by: Joanah Pwanedo Amos, et al.
Published: (2025-01-01)

Similarities And Differences Between Gpsg And Hpsg Grammars Applied To The Arabic Language
by: Abdelmadjid Achit, et al.
Published: (2011-12-01)

Privacy-Preserving Healthcare Data Interactions: A Multi-Agent Approach Using LLMs
by: Carmen De Maio, et al.
Published: (2025-03-01)

A benchmark dataset of narrative student essays with multi-competency grades for automatic essay scoring in Brazilian PortugueseKaggle
by: Hilário Oliveira, et al.
Published: (2025-06-01)

Development and Evaluation of Learning Portfolio Query System Based on LangChain Framework
by: Nien-Lin Hsueh, et al.
Published: (2025-04-01)

محمود تیمور کأدیب اسلامی
by: Shafiqa Bushra, et al.
Published: (2021-02-01)