TQFLL: a novel unified analytics framework for translation quality framework for large language model and human translation of allusions in multilingual corpora

In large language models (LLMs), the translation quality has limitations in the translation when translated into different languages. This study compares Chinese allusions in human and machine translated corpora translated by OpenAI GPT-3.5, Volctrans, and human translated texts. The framework innov...

Full description

Saved in:
Bibliographic Details
Main Authors: Li Yating, Muhammad Afzaal, Xiao Shanshan, Dina Abdel Salam El-Dakhs
Format: Article
Language:English
Published: Taylor & Francis Group 2025-01-01
Series:Automatika
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/00051144.2024.2447652
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In large language models (LLMs), the translation quality has limitations in the translation when translated into different languages. This study compares Chinese allusions in human and machine translated corpora translated by OpenAI GPT-3.5, Volctrans, and human translated texts. The framework innovatively combines two automated evaluation metrics, BLEU and METEOR, with a translation quality assessment method derived from Fuzzy Mathematics and Optimality Theory. The findings of the study indicate that the GPT-3.5 translated version exhibits higher quality than the Volctrans version when evaluated by a machine. Similarly, human evaluations indicate that among the three versions, Volctrans is of the lowest quality, while the human version exceeds the GPT-3.5 version in terms of quality. Thus, the study further reveals that Volctrans version is deemed to have the lowest translation quality from both human and machine evaluation perspectives. Finally, this study not only introduces but also validates a novel framework for assessing machine translation quality.
ISSN:0005-1144
1848-3380