An enhanced pivot-based neural machine translation for low-resource languages

This study examines the efficacy of employing Indonesian as an intermediary language to improve the quality of translations from Javanese to Madurese through a pivot-based approach utilizing neural machine translation (NMT). The principal objective of this research is to enhance translation precisio...

Full description

Saved in:
Bibliographic Details
Main Authors: Danang Arbian Sulistyo, Aji Prasetya Wibawa, Didik Dwi Prasetya, Fadhli Almuíini Ahda
Format: Article
Language:English
Published: Universitas Ahmad Dahlan 2025-05-01
Series:IJAIN (International Journal of Advances in Intelligent Informatics)
Subjects:
Online Access:https://ijain.org/index.php/IJAIN/article/view/2115
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This study examines the efficacy of employing Indonesian as an intermediary language to improve the quality of translations from Javanese to Madurese through a pivot-based approach utilizing neural machine translation (NMT). The principal objective of this research is to enhance translation precision and uniformity among these low-resource languages, hence advancing machine translation models for underrepresented languages. The data collecting approach entailed extracting parallel texts from internet sources, followed by pre-processing through tokenization, normalization, and stop-word elimination algorithms. The prepared datasets were utilized to train and assess the NMT models. An intermediary phase utilizing Indonesian is implemented in the translation process to enhance the accuracy and consistency of translations between Javanese and Madurese. Parallel text corpora were created by collecting and preprocessing data, thereafter, utilized to train and assess the NMT models. The pivot-based strategy regularly surpassed direct translation regarding BLEU scores for all n-grams (BLEU-1 to BLEU-4). The enhanced BLEU ratings signify increased precision in vocabulary selection, preservation of context, and overall comprehensibility. This study significantly enhances the current literature in machine translation and computational linguistics, especially for low-resource languages, by illustrating the practical effectiveness of a pivot-based method for augmenting translation precision. The method's dependability and efficacy in producing genuine translations were proved through numerous studies. The pivot-based technique enhances translation quality, although it possesses limitations, including the risk of error propagation and bias originating from the pivot language. Further research is necessary to examine the integration of named entity recognition (NER) to improve accuracy and optimize the intermediate translation process. This project advances the domains of machine translation and the preservation of low-resource languages, with practical implications for multilingual communities, language education resources, and cultural conservation.
ISSN:2442-6571
2548-3161