What type of study is this?

This is a Experimental Study study.

What question did this study set out to answer?

This research aims to analyze the impact of tokenization strategies on English to Arabic neural machine translation under low-resource conditions.

February 16, 2026Open Access

An Empirical Study of Transformer-Based Neural Machine Translation for English to Arabic

Puntos clave

This research aims to analyze the impact of tokenization strategies on English to Arabic neural machine translation under low-resource conditions.
Conducted controlled experiments comparing various tokenization methods for NMT
Used Tiny Transformer model for consistent architecture and training procedures
Assessed translation quality using metrics such as BLEU and BERTScore.
Tokenization choice critically influences translation quality and optimization dynamics
BLEU metric alone is inadequate for evaluating translation quality
Byte-pair encoding achieved optimal performance in low-resource settings.

Resumen

Neural machine translation (NMT) performance is strongly influenced by tokenization strategies, particularly for morphologically rich languages such as Arabic. Despite the importance of tokenization, there is a lack of controlled, reproducible studies examining its impact under low-resource conditions, which limits our understanding of how different methods affect translation quality and training dynamics. This paper presents a controlled experimental study analyzing the impact of different tokenization methods on English → Arabic (EN → AR) translation using a Tiny Transformer model under low-resource conditions. The study aims to provide a systematic and reproducible comparison that isolates the effect of tokenization choices under fixed modeling and training constraints. Experiments are conducted with identical architecture, training steps, decoding procedure, and evaluation pipeline to ensure reproducibility. Translation quality is assessed using multiple metrics including BLEU, ChrF++, TER, and BERTScore, revealing substantial divergences and demonstrating empirically, in the context of low-resource Arabic NMT, that BLEU alone is insufficient for reliable evaluation. While the limitations of BLEU are known in general, our results provide new evidence showing that, under low-resource conditions and across different tokenization strategies, reliance on BLEU can lead to misleading conclusions about translation quality. Training dynamics are analyzed using TensorBoard, linking tokenization strategies to differences in convergence, saturation, and stability. For validation, a small-scale English → German (EN → DE) experiment confirms that the Tiny Transformer setup reproduces expected behavior. The contribution of this work lies in establishing controlled empirical evidence and practical insights, rather than absolute performance gains, for low-resource Arabic NMT. Our results provide controlled evidence that tokenization choice critically affects both translation quality and optimization dynamics, offering practical guidance for low-resource Arabic NMT research. Overall, byte-pair encoding (BPE) achieves the strongest balance across surface-level and semantic metrics under controlled low-resource conditions (BLEU: 8.57, ChrF++: 18.56, TER: 97.38, BERTScore-F1: 0.785). Character-level tokenization yields higher semantic similarity than subword-based methods, as reflected by BERTScore, but remains weaker in structural fidelity and surface-form accuracy, while SentencePiece exhibits intermediate behavior, favoring semantic adequacy over exact n-gram matching. These results confirm that tokenization choice critically influences both evaluation outcomes and optimization behavior, and that BLEU alone is insufficient for assessing Arabic translation quality.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Alrashidi et al. (Sat,) studied this question.

synapsesocial.com/papers/69926552eb1f82dc367a1474 https://doi.org/https://doi.org/10.3390/info17020198

Me gusta

Guardar

Ver artículo completo