August 6, 2025Open Access

Fine-Tuning Methods and Dataset Structures for Multilingual Neural Machine Translation: A Kazakh–English–Russian Case Study in the IT Domain

Key Points

Single-stage fine-tuning outperforms two-stage approaches for low-resource pairs like Kazakh to Russian.
A 50,000-triplet dataset in the IT domain enhances translation consistency compared to non-triplet structures.
Evaluation metrics such as BLEU and chrF are used to assess translation quality across different methods.
The framework provides strategies for applying neural machine translation to low-resource languages effectively.

Abstract

This study explores fine-tuning methods and dataset structures for multilingual neural machine translation using the No Language Left Behind model, with a case study on Kazakh, English, and Russian. We compare single-stage and two-stage fine-tuning approaches, as well as triplet versus non-triplet dataset configurations, to improve translation quality. A high-quality, 50,000-triplet dataset in information technology domain, manually translated and expert-validated, serves as the in-domain benchmark, complemented by out-of-domain corpora like KazParC. Evaluations using BLEU, chrF, METEOR, and TER metrics reveal that single-stage fine-tuning excels for low-resource pairs (e.g., 0.48 BLEU, 0.77 chrF for Kazakh → Russian), while two-stage fine-tuning benefits high-resource pairs (Russian → English). Triplet datasets improve cross-linguistic consistency compared with non-triplet structures. Our reproducible framework offers practical guidance for adapting neural machine translation to technical domains and low-resource languages.

Read Full Paperexternally

KI fragen

Bookmark

View Full Paper